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OM nucleic - nucleic search, using sw model 



Run on: 



March 5, 2004, 16:40:19 



; Search time 6893 Seconds 
(without alignments) 

11343.513 Million cell updates/sec 



Title: 

Perfect score 
Sequence : 



US-09-668-314C-1 
1804 

1 atgggcgcactggcccgggc 



aaaccaaaaaaaaaaaaaaa 1804 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



3470272 seqs, 21671516995 residues 



Total number of hits satisfying chosen parameters: 



6940544 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 
BD235885 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

COMMENT 



PAT 17-JUL-2003 



FEATURES 

source 



BD235885 1804 bp DNA linear 

Alzheimer's disease secretase. 
BD235885 

BD235885 .1 GI : 33045655 
JP 2002526081-A/l. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1804) 

Gurney,M.E., Bienkowski , M. J. , Heinrikson, R. L . , Parodi,L.A. and 
Yan, R. 

Alzheimer's disease secretase 

Patent: JP 2002526081-A 1 20-AUG-2002; 

PHARMACIA AND UPJOHN CO 

OS Homo sapiens (human) 

PN JP 2002526081-A/l 

PD 20-AUG-2002 

PF 23-SEP-1999 JP 2000574268 
PR 24-SEP-1998 US 60/101594 

PI MARK E GURNEY, MICHAEL JEROME BIENKOWSKI , ROBERT LEROY PI 
HEINRIKSON, 

PI LUIS A PARODI , RIQIANG YAN 

PC C12N15/09, A61K4 5/00, A61P25/2 8 , C07K14/ 4 7 , C07K1 6/18 , C12N1/15, PC 
C12N1/19, 

PC 

C12Nl/21,C12N5/10,Cl2N9/64,C12P21/02,Cl2P21/08,C12Ql/37, G01N33/ PC 
15, 

PC G01N33/50// (C12N1/21, C12R1:19) , C12N15/ 00 , C12N5/ 00 CC 

Alzheimer's disease secretase 

FH Key Location/Qualifiers 

FT source 1. .1804 

FT /organism= ' Homo sapiens (human)'. 

Location/ Qualifiers 
1. .1804 

/organism="Homo sapiens" 
/ mol_type= " genomi c DNA" 
/db xref="taxon: 9606" 



ORIGIN 



Query Match 100.0%; Score 1804; DB 6; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 18 04; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
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1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | | M 1 II 
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GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 

1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 
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Db 
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CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | 
CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 
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GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 
1 1 M 1 M 1 If 1 1 1 1 1 1 1 1 1 1 1 1 f 1 I I I I | | f | | | f M | || | | | | | | | | | | | M | i | | | | | 
GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 
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C GT GAC C CT GAGGT CGT CAAT GAT GAGT C C T C T CT GGT C AGACAT CG CT GGAAAT GAAT A 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 if 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CGT GAC C C T GAGGT CGT CAAT GAT GAGT C C T C T C T GGT C AGACAT C G CT G GAAAT GAAT A 


1560 


Db 


1501 


1560 


Qy 
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G C C AGG C CT GAC CT CAAG C AAC CAT GAACT CAGC T AT T AAGAAAAT CAC AT T T C CAGG G C 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 If 1 1 1 1 1 1 1 1 1 1 1 1 

GC C AGGC CT G AC C T CAAG CAAC CAT GAAC T CAGC TAT T AAGAAAAT CAC AT T T C CAGG GC 


1620 


Db 


1561 


1620 



Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 


Qy 


1801 


Db 


1801 



AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 
I M I I I I I i I I I I I I I i ! I I I I I I | I | | | | | | | | M | | | | M | | | | | | | | | M | | | | | | | 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 
I I I I M I I I M I I I I I I I I I I I I I | I | | | | | | | | | | ! I M I II I I I I I I I I | | | | | | | | | 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

CT C C CT ACTT C CAAGAAAAAT AAT T AAAAAAAAAAC T T CAT T CT AAAC CAAAAAAAAAAA 18 00 

I I I I I I I M I I M M I I I I I I I I I I I I I I I I I I I M I I I I I | | | | | | | | | | | || | | , | | | 

CT C C CT ACT T C CAAGAAAAAT AAT T AAAAAAAAAAC T T CAT T CT AAAC CAAAAAAAAAAA 18 00 



I I I 



AR224092 1804 bp DNA 

Sequence 1 from patent US 6440698. 
AR224092 

AR224 0 92 .1 GI : 23 332752 



linear PAT 26-SEP-2002 



RESULT 2 
AR224092 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 

SOURCE Unknown. 
ORGANISM Unknown. 

Unclassi f ied . 
1 (bases 1 to 1804) 

Gurney,M.E., Bienkows ki , M. J . , Heinrikson, R. L . , Parodi,L.A. and 
Yan, R. 

Alzheimer's disease secretase, APP substrates therefor, and uses 
therefor 

Patent: US 6440698-A 1 27-AUG-2002; 
Location/Qualifiers 
1. .1804 
/organism^ "unknown" 
/mol_type=" genomic DNA" 

ORIGIN 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
FEATURES 

source 



Query Match 100.0%; Score 1804; DB 6; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 1804; Conservative 0; Mismatches 0; 



Length 18 04; 
Indels 0; Gaps 



0; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 



N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 



I N I M I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | 



M I I I I I I I II I I I I I I M I I I I I I M I I I I I I I I I I II I | | | | | | | | | | | | | | M I I I I 



Qy 



181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 
I I I I I I I I I I I I I M I I I I I I I I I I I I I I | I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 



Db 



181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 



Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 

1 1 M 1 1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 If 1 1 1 1 1 1 II 1 I 1 1 1 

GT AGACAAC C T G CAG GG GGAC TCTGGCCGC G GC T AC T AC CT G GAGAT G CT GAT C GG GAC C 


300 


Db 


241 


300 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 
WWW M 1 1 II 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | | | || mi 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 


360 


Db 


301 


360 


Qy 


361 


AC C C C GCACT C CT AC ATAGACAC GT ACT T T GACACAGAGAGGT CT AGCACAT AC C G C T C C 

1 1 f 1 1 M f 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | | M | 1 1 1 II 1 1 II 1 1 1 

AC C C C GCACT C C T AC ATAGACAC GT ACT T T GACACAGAGAG GT CT AGCACAT AC C GC T C C 


420 


Db 


361 


420 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

' i 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M | | | | | | | | | | | | | 1 
AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


421 


480 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 
1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | f | | | | | | | | | | | | { | | | | | | | | 
GAC CT C GT CAC CAT C C C CAAAGGCT T CAAT ACT T CT T T T CT T GT CAAC ATT G C C ACT AT T 


540 


Db 


481 


540 


Qy 


541 


TT T GAAT CAGAGAAT TTCTTTTTGCCT GGGAT T AAAT GGAAT GGAAT AC T T G G C C T AGCT 

1 1 t 1 1 1 1 M 1 1 1 1 1 1 1 II 1 M II 1 1 1 1 1 1 I M | | | | | M | | | | | | | | | | | | | | | 

T T T GAAT CAGAGAAT TTCTTTTTGCCT GGGAT T AAAT GGAAT GGAAT AC T T GGC C T AGCT 


600 


Db 


541 


600 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 

M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


601 


660 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 
1 1 1 1 1 1 1 1 i f 1 M 1 1 1 1 1 1 1 1 1 I I I I || | | | | | | M | | | | | | | | | | | | | | | | | | | | | | | | 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


661 


720 


Qy 


721 


GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 

1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 


780 


Db 


721 


780 


Qy 


781 


GGAGACAT CT GGTAT ACCCCTATTAAGGAAGAGT GGTACTACCAGATAGAAATT CT GAAA 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I 1 1 1 I II 1 1 if 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 IE 

G GAGAC AT CT GGTAT AC C C CT AT T AAGGAAGAGT G GT ACTAC CAGAT AGAAAT T CT GAAA 


840 


Db 


781 


840 


Qy 


841 


T T GGAAAT T GGAGG C C AAAG C CT T AAT C T GGACT G C AGAGAGT AT AAC G C AGACAAGG C C 

1 1 1 1 1 1 1 1 1 1 f 1 I f 1 1 | | | | | | | | | | | | 1 1 1 II 1 f f 1 1 1 f | | | | | 1 1 f 1 1 1 1 1 1 f I I 1 1 1 
TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 


900 


Db 


841 


900 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 

1 ' 1 1 N 1 1 1 1 M 1 1 f 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 I f 1 1 f | | | f | | | | | 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 
ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 


Db 


901 


960 


Qy 


961 


GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 f 1 1 M 1 1 1 i 1 1 I 1 1 1 1 1 1 1 1 1 1 I I 1 I I | | | | | | | | | | 
GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 


1020 


Db 


961 


1020 


Qy 


1021 


CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 
1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I I I I I I | | | | | | | | | | | | | | | f | | | | | | | | | | | | | | | | || 

CAGCTGGCGTGCT GGAC GAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTC CATC 


1080 


Db 


1021 


1080 



Qy 


1081 


T AC CT GAGAGAT GAGAAC T C C AGCAG GT CAT T C C GT AT C ACAAT C CT G C CT CAGC T T T AC 

1 1 f 1 1 i 1 1 M M M 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I f M I I I | j I I f | | | | | | | | | | | | | | | | | | 

T AC C T GAGAGAT GAGAACT C C AGCAGGT CAT T C C GT AT C ACAAT C CT GC C T CAGC TT T AC 


1140 


Db 


1081 


1140 


Qy 


1141 


ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 

' 1 1 1 f 1 II 1 1 1 f 1 1 1 1 1 I I I I i I I If 1 f| 1 1 1 I ! I I 1 I I I ! I I | | j | | | M 1 1 1 1 M ! 1 1 

ATT CAGC C CAT GAT GGGGGCCGGC CT GAAT TAT GAAT GTT AC C GAT T C G G CAT T T C C C C A 


1200 


Db 


1141 


1200 


Qy 


1201 


T C CACAAAT G C GCT GGT GAT C G GT G C C AC GGT GAT GGAGG GC T T C T AC GT CAT C T T C GAC 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I II 1 1 1 II 1 1 I II 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 

T C CACAAAT GC GCT G GT GAT C G GT G C C AC GGT GAT GGAGG G CT T CT AC GT CAT C T T C GAC 


1260 


Db 


1201 


1260 


Qy 


1261 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 
i 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | 
AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


1320 


Db 


1261 


1320 


Qy 


1321 


GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 
M N I 1 1 1 1 1 1 I I M | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | , , 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


1380 


Db 


1321 


1380 


Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 
1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1440 


Db 


1381 


1440 


Qy 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 | | | | II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 
GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 


1441 


1500 


Qy 


1501 


C GT GAC C CT GAGGT C GT CAAT GAT GAGT CCTCTCTG GT C AGACAT C G CT GGAAAT GAAT A 

1 1 1 1 1 i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I I I | | | ! M | | | | | | | | | | | | | | | | | | | | 

C GT GAC C CT GAGGT C GT CAAT GAT GAGT CCTCTCTG GT C AGACAT C GCT G GAAAT GAAT A 


1560 


Db 


1501 


1560 


Qy 


1561 


GCCAGGCCTGACCTCAAGCAACCATGAACTCAGCTATTAAGAAAATCACATTTCCAGGGC 

1 N 1 1 1 1 1 1 1 I 1 1 1 | 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 1 1 
G C CAGGC CT GAC CT CAAGCAAC CAT GAACT C AGCT AT TAAGAAAAT C AC AT T T C CAGGGC 


1620 


Db 


1561 


1620 


Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 

M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | 1 1 1 1 1 1 1 IN 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


1621 


1680 


Qy 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 
N 1 1 1 II 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 I I I I | | | | | | | | | | | | | | | | | | | | | | | f | | | | | | | | | | 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1681 


1740 


Qy 


1741 


CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAAACCAAAAAAAAAAA 
1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | | | | | | | | j | | | | | | | | | j | | | | 
C T C C C T AC T T C C AAG AAAAAT AAT T AAAAAAAAAAC T T CAT T C T AAAC C AAAAAAAAAAA 


1800 


Db 


1741 


1800 


Qy 


1801 


AAAA 1804 




Db 


1801 


MM 

AAAA 1804 





RESULT 3 
AR269223 
LOCUS 



AR269223 



1804 bp 



DNA 



linear 



PAT 10-APR-2003 



DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
FEATURES 

source 



ORIGIN 



Sequence 
AR269223 
AR269223 



from patent US 6500667. 
GI : 29700191 



Unknown . 

Unknown . 

Unclassified. 

1 (bases 1 to 1804) 

Gurney,M.E. , Bienkowski,M. J. , Heinrikson, R. L . , Parodi,L.A. 
Yan, R. 

Aspartyl protease 2 (Asp2) antisense oligonucleotides 
Patent: US 6500667-A 1 31-DEC-2002; 

Location/ Qualifiers 

1. .1804 

/ organism= M unknown" 
/mol type=" genomic DNA" 



and 



Query Match 100.0%; Score 1804; DB 6; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

I M I f I I I I I I I I I I I I I I I I I I II i I I I I I I I I I I I I I I I I II I M I I I I I I | | | | | | | 
Db 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

QY 61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I | | | | | | 
Db 61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

Qy 121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

I M I I I I I I I I I I M I I I I I I I I I I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I 
Db 121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 



Qy 1B1 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 

M I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | I I 

Db 181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 

Qy 241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

I I I I I I I I I I I M I M I I I I I I I I I M I I I I M M I I II I | | M | | | | | | | | | | | | | | | i 

Db 241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

Qy 301 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 

M I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | M | | | | M 

Db 301 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 



Qy 361 ACCCCGCACTCCTACATAGACACGTACTTTGACACAGAGAGGTCTAGCACATACCGCTCC 42 0 

I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I | | | | | | | | | 
Db 3 61 ACCCCGCACTCCTACATAGACACGTACTTTGACACAGAGAGGTCTAGCACATACCGCTCC 42 0 

Qy 421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 480 

I I I I M I I II I I I I I M I | | | | | | | | | | | | | | | | | | M | | | | | | | | | | || j | | | | | | | | | 
Db 421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 80 



Qy 481 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I | I | | | | | | | I I I 
Db 4 81 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 



Qy 


541 


TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 
1 M 1 1 1 1 1 1 I I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 I 1 | | | || | | | | | | | | | | | | | | | | | | | | | | | | | | 

TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 


600 


Db 


541 


600 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 
! 1 1 1 M 1 1 1 t II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 | I | | | | | | | | | | | | | | | | | | | | | | | | | | | || 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


601 


660 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 
i M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 | M 1 1 1 II 1 1 1 1 1 I I I I I I I I I | | | | | | | | | | | | | | | | | | 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


72 0 


Db 


661 


720 


Qy 


721 


GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I I 1 I I I | | | | | | | | | | | | | M 

GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 


78 0 


Db 


721 


78 0 


Qy 


781 


G GAGACAT CT GGT AT AC C C CT AT T AAG GAAGAGT G GT AC T AC C AGAT AGAAAT T CTGAAA 
1 i 1 1 1 1 I 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | M | | | | | | | | | | | | | | | | | | | | | | 
GGAGACAT C T GGT AT AC C C CT AT T AAG GAAGAGT GGT AC T AC C AGAT AGAAAT T CT GAAA 


840 


Db 


781 


840 


Qy 


841 


T T GGAAAT T GGAGGC CAAAGC CT T AAT CT G GACT GCAGAGAGT ATAAC GCAGAC AAG G C C 

1 M 1 M 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | 1 1 1 1 1 1 1 1 1 

T T GGAAAT T GGAGGC CAAAGC CT T AAT CT GGACT GCAGAGAGT ATAAC GCAGACAAG G C C 


900 

S \J \J 


Db 


841 


900 

—S \J \J 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 

M M 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I | | | | | | | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I | | | | | 

ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 

•—-r \J 


Db 


901 


960 


Qy 


961 


GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 


1020 


Db 


961 


1020 


Qy 


1021 


CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 


1080 


Db 


1021 


1080 


Qy 


1081 


T AC CT GAGAGAT GAGAACT CC AG CAGGT CATT C C GTAT CACAAT C C T G C CT CAGC T T T AC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 I I I I | | I | | | 
T AC CT GAGAGAT GAGAACT C C AGCAG GT CAT T C C GTAT CACAAT C CT G C CT C AG C T T T AC 


1140 


Db 


1081 


1140 


Qy 


1141 


ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 
1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II I | | | | | | | | M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 ! 1 1 1 
ATT CAGC C CAT GAT GGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 


1200 


Db 


1141 


1200 


Qy 


1201 


TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 
TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 


1260 


Db 


1201 


1260 


Qy 


12 61 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 

1 1 M 1 I 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 ! M 1 1 1 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


1320 


Db 


1261 


1320 


Qy 


1321 


GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I 1 I I 1 1 I I I I I I | | I | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


1380 


Db 


1321 


1380 



Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 
1 M 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I | | | | | | | | | | | | | I | i 1 | | | 
CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1440 


Db 


1381 


1440 


Qy 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 

1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 M 1 ! 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 I | M | | | 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 


1441 


1500 


Qy 


1501 


C GT GAC C CT GAGGT C GT CAAT GAT GAGT CCTCTCTG GT CAGACAT C GC T GGAAAT GAAT A 
1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | If 1 1 1 I I I I M 
C GT GAC C CT GAGGT C GT CAAT GAT GAGT CCTCTCTG GT CAGACAT C GCT G GAAAT GAAT A 


1560 


Db 


1501 


1560 


Qy 


1561 


G C C AGGC C T GAC CT CAAG CAAC CAT GAAC T C AG CTATTAAGAAAAT CAC AT T T C CAGGG C 
1 1 1 1 1 1 M II 1 1 1 1 1 1 1 I I I I I I I I I I | I | | | | | | | | | | | | | | M | | | | | | | | | | 1 1 | I 1 

T l *' iii iiiiitiiii iipi i| | llllllll 

GCCAGGCCTGACCTCAAGCAACCATGAACTCAGCTATTAAGAAAATCACATTTCCAGGGC 


1620 


Db 


1561 


1620 


Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 
1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 I I I I I I | | | | | | 1 1 | 1 
AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


1621 


1680 


Qy 


1681 


GC T C C C AGAT GC CTT CTAGAT T C ACT GT CT T T T GATT CT T GAT T T T CAAGC T T T CAAAT C 
1 1 1 ' 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

ill ii itiii IFII4flllli(lfJ||J||||| | 1 1 | 1 1 III 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1681 


1740 


Qy 


1741 


CT CC CT ACT T C CAAGAAAAAT AAT TAAAAAAAAAACT T C ATT C T AAAC CAAAAAAAAAAA 

M M 1 1 1 II 1 II II I I I | | M | | | I | | | | | | | | | | | | | m 1 M 1 1 1 1 1 1 1 1 1 I I I I I I I I 

CT CCCTACTT C CAAGAAAAAT AATTAAAAAAAAAACTT CAT T CT AAAC CAAAAAAAAAAA 


1800 


Db 


1741 


1800 


Qy 


1801 


AAAA 1804 




Db 


1801 


1 1 II 

AAAA 1804 





RESULT 4 
AX105383 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



AX105383 
Sequence 1 
AX105383 
AX105383. 1 



1804 bp 
from Patent WO0123533 

GI : 13921510 



DNA 



linear 



PAT 30-APR-2001 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Gurney,M. and Bienkowski , M. J . 

Alzheimer's disease secretase, app substrates therefor, and uses 
therefor 

Patent: WO 0123533-A 1 05-APR-2001; 
Pharmacia & Upjohn Company (US) 

Location/Qualifiers 

1. .1804 

/organism="Homo sapiens" 
/mol_type="unas signed DNA" 
/db xref="taxon: 9606" 



ORIGIN 



Query Match 100.0%; Score 1804; DB 6; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


l 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 
1 1 1 1 I 1 1 1 1 1 1 1 I 1 1 I 1 f 1 1! | | 1 | | | | | | f | | | | | | f | | | | | | | | | | J | | | ( | | f | | | | 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


i 


60 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 
1 1 t 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | M 1 1 1 M 1 1 1 I I I I I | | M | | | | | | | 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


61 


120 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

1 1 1 1 M 1 1! 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | | | M 1 1 1 1 1 1 M 1 1 1 
CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


121 


180 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

1 1 I 1 1 1 1 1 1 1 1 M 1 1 f 1 1 I 1 1 1 1 1 1 M | | | | | | | f | | | | I | | | I 1 1 1 1 1 1 1 I I | M 1 1 1 1 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


181 


240 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 

1 1 1 1 1 f 1 1 1 t II 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 f I I I I I | | | | | | 1 1 1 1 1 1 II II 1 
GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


241 


300 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 

1 1 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 I I I I | f | | | | | | M 1 1 1 
CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 


360 


Db 


301 


360 


Qy 


361 


AC C C C GC ACT C C T ACAT AGAC AC GT ACT T T GACAC AGAGAGGT CT AG CACAT AC C GCT C C 

1 N 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 | | | | | 1 1 1 f M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AC C C C GC ACT C C T ACATAGACAC GT ACT T T GACAC AGAGAGGT CT AG CACAT AC C GCT C C 


420 


Db 


361 


420 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

M i 1 f If 1 1 1 M 1 I II 1 1 f 1 1 1 1 1 1 1 1 1 f 1 | | | | | II 1 I 1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 II 
AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


421 


480 


Qy 


481 


GAC C T C GT CAC CAT C C C CAAAGGCT T CAAT ACT TCTTTTCTT GT CAAC AT T GC C ACT AT T 

f 1 M 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | | || | | | | | | | | | | | | || | | | | 

GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


481 


540 


Qy 


541 


TT T GAAT C AGAGAAT TTCTTTTTGCCT GGGAT TAAAT GGAAT GGAAT ACT T G GC CT AGC T 

M I 1 1 1 1 1 1 1 1 1 1 I 1 1 1 | | | f ! 1 1 1 1 1 1 i 1 1 1 1 M 1 1 1 M 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 
TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 


600 


Db 


541 


600 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 
1 1 1 1 1 1 1 1 1 1 M II 1 II 1 1 i | M | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | || 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


601 


660 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 
1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I M I I I | | | | | | | | | | | | | | | | | | | | | | | || | | | | | | | | | 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


661 


720 


Qy 


721 


GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 
1 1 1 1 1 1 M f 1 II 1 1 | | M | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | ( l 

G GAT CT G GGAC CAAC GGAGGTAGT CT T GTCTTGGGT GGAAT TGAACCAAGTTTGT AT AAA 


780 


Db 


721 


780 


Qy 


781 


GGAG ACAT C T G GT AT AC CC C TAT TAAGGAAGAGT G GT ACT AC C AGAT AGAAAT T CT GAAA 


840 



781 G GAGACAT C T G GTAT AC C C C TAT TAAGGAAGAGT GGT ACT AC C AGAT AGAAATT CT GAAA 84 0 
841 T T G GAAAT T G GAGG CCAAAGC C T T AAT CT GGAC T GC AGAGAGT ATAAC GCAGACAAGG C C 900 

I I I I I I I I I I I I I I I I I I I ! I I I I I I I I ! I I M I I I I I I I I I I I I I I I | | | f | | | | | | 

841 T T GGAAAT T G GAG GC CAAAG C CT T AAT C T G GACT GC AGAGAGT AT AAC G CAGACAAG GC C 900 
9 01 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

I I M I M I I I M I I I I I I I I I I I | | M I II I i I I I I II t I I I I I I I I I I I | | | | | | | | | | 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGT GTTTGATGCGGTGGTG 960 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 
I I I I I I I I i I I I M I M I I I I I I I I I I I | | | | M | | | | | | | | | | | | | | | | | | | | | | ,| | | 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

I I ' I I I I II I I I I M I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I II I I I If 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

10 81 T AC C T GAGAGAT GAGAACT C CAGCAGGT CAT T C C GTAT CACAAT C CT GC CT C AG CT T T AC 114 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I I I I I I I I I 
1081 T AC CT GAGAGAT GAGAACT C CAGC AG GT CAT T C C GTAT CACAAT C CT GC CT C AGC T T T AC 114 0 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1200 

I I I I I I I I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I || | | | | | | | | 
1141 ATT C AGC C CAT GAT GGGGGCC G GC C T GAAT TAT GAAT GTT AC C GAT T C G G CAT T T C C C C A 1200 

1201 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1260 

I N I I I I f M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1201 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1260 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I | 
1261 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1320 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 1380 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I 
1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

13 81 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

N I I I I I I I I I I I I I I I I I I II I M I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | 

13 81 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

1501 C GT GAC C C T GAG GT C GT C AAT GAT GAGT CCTCTCTG GT CAGACAT C GCT GGAAAT GAAT A 1560 

I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
1501 C GT GAC C C T GAGGT C GT CAAT GAT GAGT CCTCTCTGGT CAGACAT C GC T GGAAAT GAAT A 1560 

15 61 G C C AG G C C T GAC CT CAAGCAAC CAT GAACT C AGCT AT TAAGAAAAT C AC ATT T C C AG GGC 162 0 

I I M I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I II I II II 

1561 G C C AGG C C T GAC CT CAAGCAAC CAT GAACT C AG CT AT TAAGAAAAT C AC AT T T C CAG GGC 162 0 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 
I I I M I I I I I I I I I I I I I I I I I I I I I | | I I I I I II I I II I I I I I I II I I I | | I II I I I I I 



Db 



1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 16 8 0 



QY 1681 GCT C C CAGAT G C CT T CT AGAT T C ACT GT C T TT T GAT T CT T GAT T T T CAAGCT TT CAAAT C 174 0 

I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I | | | | | | | M ! j | | | | || | | | | | | | | || | 
Db 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1740 

Qy 1741 C T C C C T ACT T C CAAGAAAAATAATTAAAAAAAAAACT T CAT T CT AAAC C AAAAAAAAAAA 18 00 

I M M I I II I I I I I I I I M I I I I I I | M I I I I I I I I I I I | | | | || | | | | | | | | If 

Db 1741 C T C C C T AC T T C C AAG AAAAAT AAT T AAAAAAAAAAC T T CAT T C T AAAC C AAAAAAAAAAA 18 00 

Qy 1801 AAAA 1804 

I I I I 

Db 1801 AAAA 1804 



RESULT 5 
AX573821 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



AX573821 
Sequence 
AX573821 
AX573821 



1 from Patent 



1804 bp 
EP1249498 



DNA 



linear 



PAT 07- JAN-2003 



GI:27551476 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Gurney,M. and Bienkowski, M. J. 

Alzheimer's disease secretase, app substrates therefor, and uses 
therefor 

Patent: EP 1249498-A 1 16-OCT-2002; 
PHARMACIA & UPJOHN COMPANY (US) 

Location/Qualifiers 

1. .1804 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db xref="taxon: 9606" 



ORIGIN 



Query Match 100.0%; Score 1804; DB 6; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 1804; Conservative 0; Mismatches 0; 



Length 1804; 



Indels 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



0; Gaps 



61 



61 



121 



181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 
M I I I I II I I | I | | | M I I I I I I | | | | | | | M || | | | | | || | | | | M I I 



0; 



1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

I I I I I N I I I I I I I I I I I I I I | | | | | | | | | | M I I I I I I I I I | | | | | | | | | | | | M | | | I 
1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 



60 



120 



GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 
I I I I I M I I I I I I I I I | | | | | | | | | | M | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 



180 



CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

I I I I I I I I t I I I | I I | | | | | | | | | | | | | | | | | | | | | I I I I I I I I I II I I I I I I 

121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 



240 



Db 



181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 2 4 0 



Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 
1 ' I 1 1 1 1 1 1 f 11 1 1 1 f 1 1 1 1 1 1 1 | | | | | | | | j | | | f | | | f | | | | j | | | | | | | | | | | | | | | 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


241 


300 


Qy 


301 


C C C C C G C AGAAG C T ACAGAT T C T C GT T GAC AC T GGAAGCAGT AACT T T G C C GT G G C AG GA 

M 1 M 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 II M 1 I I I I I I f | | I j | | t I i I | | j | | M 1 1 f I 1 1 1 1 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 


360 


Db 


301 


360 


Qy 


361 


AC C C C GC ACT C CT AC AT AGACAC GT AC TTT GACAC AGAGAG GT C TAG C AC ATAC CGCT C C 

1 1 " 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I 1 1 1 1 1 i 1 I 1 I M I 1 I 1 1 1 I I I I I | | | | | f | | M j | M 1 1 

AC CCCG C ACT C CTACAT AGACAC GT AC T TT GAC AC AGAGAG GT CT AG C AC AT AC C GCT C C 


420 


Db 


361 


420 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

N 1 1 M 1 1 1 1 1 1 1 1 1 1 | | | | | 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 I I I | | | | | 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


421 


480 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 

1 1 1 ' 1 1 M 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 
GACCT C GT CAC CAT C C C CAAAGGC T T CAAT ACT T C T T T T CTT GT C AAC AT T GC C ACT AT T 


540 


Db 


481 


540 


Qy 


541 


TTT GAAT C AGAGAATT T CTTTTTGCCT GGGAT TAAAT G GAAT GGAAT AC T T GGC CT AGC T 

1 1 1 1 f 1 1 I f M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | | | | | | | | | | || | | | | | j | | | | | | | | | 

TTT GAAT C AGAGAATT T C T T T T T GC CT GGGAT TAAAT GGAAT G GAAT AC T T GGC CT AG CT 


600 


Db 


541 


600 


Qy 


601 


TAT G C CAC AC T T GC CAAGC CAT CAAGTT CT CT GGAGAC CT T CT T C GACT C C C T GGT GAC A 

1 1 I 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 M | | M 

TAT G C CAC AC T T G C CAAGC CAT CAAGTT CT CT GGAGAC CT T C T T C GACT C C C T GGT GACA 


660 


Db 


601 


660 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 

1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M M | | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | | | | 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


661 


720 


Qy 


721 


G GAT C T G GGAC CAAC GGAG GT AGT C T T GT C T T GGGT G GAAT T GAAC CAAGT T T GT AT AAA 

1 1 1 1 1 M 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I | | | | | | | | | | | | | | | | | | | | | || | ( | | j | | j | 

GG AT C T G GGAC CAAC GGAGGT AGT C T T GT CT T GGGT GGAAT T GAAC CAAGTT T GT AT AAA 


780 


Db 


721 


780 


Qy 


781 


GGAGACAT CT GGTATACCCCTATTAAGGAAGAGT GGTACTACCAGATAGAAATT CT GAAA 

1 1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | || | , j 

G GAGAC AT C T GGT AT AC C C C TAT T AAG GAAGAGT G GT AC T AC C AGAT AGAAAT T C T GAAA 


840 


Db 


781 


840 


Qy 


841 


TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 
1 1 N 1 1 i 1 1 1 1 1 1 1 1 | | | | | | | || | | | | | | | | M | | | | | | | | | || | | | | | | | | | | | | | , , 
T T GGAAAT T G GAGG C CAAAGC CT TAAT C T GGACT GCAGAGAGT ATAAC GC AGACAAGGC C 


900 


Db 


841 


900 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 
1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 


Db 


901 


960 


Qy 


961 


GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 

M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I I I I I I | | | | | | 1 I 1 1 1 1 1 I I | I | | | M 1 
GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 


1020 


Db 


961 


1020 



Qy 1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

I I I I M I II I I I I I I I I I I I I I | | I I I | | | | M I I I I I I I I I I I M I I I | | | | | | | | M I 

Db 1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 



Qy 


1081 


T AC C T GAGAGAT GAGAACT C C AGCAGGT CAT T C C GT AT C ACAAT C CT G C C T C AG CT T T AC 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M II 1 1 1 I I I | | I | I I I | | | | 1 I 1 1 1 1 1 1 II M I | | | M | I 1 II 
T AC C T GAGAGAT GAGAAC T C C AG C AGGT CAT T C C GTAT C ACAAT C CT GC C T C AGCTT T AC 


1140 


Db 


1081 


1140 


Qy 


1141 


ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 
1 1 1 1 1 1 N 1 1 I 1 If 1 I I f I I | ! | | | | | | | | | | M t (1 1 1 1 1 1 1 1 f 1 1 I 1 1 1 I I I M | | M 
ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 


1200 


Db 


1141 


1200 


Qy 


1201 


TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 

1 1 1 N 1 1 1 1 II 1 1 1 1 M 1 1 1 | | | | M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 | | | | M I 1 M 

TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 


1260 


Db 


1201 


1260 


Qy 


1261 


AGAG C C C AGAAGAGGGT G G G C T T CG CAGC GAG CC C CT GT GC AGAAAT T GC AGGT GC T GC A 

1 1 N 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I | | 1 1 1 1 1 II 1 1 1 1 1 1 I I I I I | | | | | 1 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


1320 


Db 


1261 


1320 


Qy 


1321 


GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 
1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 I I I I I | | | | | | | | | | M | | | | | | | | | | | | | | M | | [ | | | | 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


1380 


Db 


1321 


1380 


Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 

I 1 1 1 M II II 1 II 1 1 II II 1 1 M II 1 II 1 II 1 II M II 1 1 1 M 1 1 1 II 1 1 1 II M II II 1 

CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1440 


Db 


1381 


1440 


Qy 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 


1441 


1500 


Qy 


1501 


C GT GAC C CT GAGGT C GT CAAT GAT GAGT CCT C T CT GGT C AGACAT C GCT G GAAAT GAATA 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 ! 1 1 1 1 1 I I I | | | | | fl 1 1 1 1 1 M 1 1 M 

C GT GAC CCT GAGGT C GT CAAT GAT GAGT CCTCTCTG GT CAGACAT C G CT G GAAAT GAATA 


1560 


Db 


1501 


1560 


Qy 


1561 


GC C AG G C CT G AC CT CAAGCAAC CAT GAACT C AG C TAT TAAGAAAAT C AC AT T T CCAG GG C 

1 i 1 1 1 N M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I I I I I ! I I I | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 

GC C AGGC CT GAC CT CAAGCAAC CAT GAAC T CAGC TAT TAAGAAAAT C AC AT T T C CAG GG C 


1620 


Db 


1561 


1620 


Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 

1 1 1 1 > i 1 1 1 1 M 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 I I I 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 
AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


1621 


1680 


Qy 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1681 


174 0 


Qy 


1741 


CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAAACCAAAAAAAAAAA 

M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | j | | I 1 1 1 || | | | | | M | | | | | | 

CT C C CT AC T T C C AAGAAAAATAAT T AAAAAAAAAAC TT CAT T CT AAAC CAAAAAAAAAAA 


1800 


Db 


1741 


1800 


Qy 


1801 


AAAA 1804 




Db 


1801 


MM 

AAAA 1804 





RESULT 6 
AF200342 
LOCUS 



AF200342 



1804 bp mRNA linear PRI 12-DEC-1999 



DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



CDS 



Homo sapiens chromosome 21 aspartyl protease 1 mRNA, complete cds . 
AF200342 

AF2 00342 .1 GI : 65 61811 

« 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1804) 

Yan,R., Bienkowski , M . J . , Shuck,M.E., Miao,H., Tory,M.C, 
Pauley, A.M., Brashier, J.R. , Stratman, N . C . , Mathews ,W.R., Buhl, A. E., 
Carter, D.B., Tomas selli , A. G . , Parodi,L.A., Heinrikson, R.L. and 
Gurney, M. E . 

Membrane-anchored aspartyl protease with Alzheimer's disease 

beta-secretase activity 

Nature 402 (6761), 533-537 (1999) 

20057170 

10591213 

2 (bases 1 to 1804) 

Bienkowski, M. J. , Shuck, M.E., Slightom, J . L . and Drong,R.F. 

Direct Submission 

Submitted (29-OCT-1999) Genomics Research, Pharmacia&Up j ohn, 301 
Henrietta, Kalamazoo, MI 49007, USA 

Location/ Qualifiers 

1. .1804 

/organism="Homo sapiens" 

/mol__type="mRNA" 

/db_xref="taxon: 9606" 

/chromosome="21" 

1. .1557 

/ codon_start=l 

/product="aspartyl protease 1" 
/protein_id="AAF17078 . 1" 
/db_xref="GI : 6561812" 

/ trans la tion="MGALAJ*ALLLPLLAQWLLRAAPELAPAP FTLPLRVAAATNRWA 
PT PGP GT PAERHADGLALALEPALAS PAGAANFLAMVDNLQGDS GRGY YLEMLI GT P P 
QKLQILVDTGSSNFAVAGTPHS YIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGE 
DLVTIPKGFNTSFLVNIATIFESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSL 
VTQANI PNVFSMQMCGAGLPVAGS GTNGGS LVLGGI EP S LYKGDI WYT P I KEEWYYQ I 
EILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSD 
GFWTGSQLACWTNSETPWS YFPKI S I YLRDENS SRSFRITI LPQLYIQPMMGAGLNYE 
CYRFGISPST NALVI GAT VME G F YVI FDRAQ K RVG FAAS P CAE I AGAAVS EISGPFST 

EDVASNCVPAQSLSEPILWIVSYALMSVCGAILLVLIVLLLLPFRCQRRPRDPEWND 
ESSLVRHRWK" 



ORIGIN 



Query Match 100.0%; Score 1804; DB 9; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

I I I I I I I I I M 11 I I I I I I I I I I ! I I I | | | | | | | | | | | | | | | | | | | | | M II I I I II I M 

Db 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

QY 61 G CCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

N I I I I I ! I I I I ! I I I I I I I t I I I I I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I 

Db 61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 120 



121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

N I I I ! I I I I I I I I I ! I I I I I I I I I I I I I ! I I I I I | | | | | | ! | | | | | | | | | | | | | f | | | | 

121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 
181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 

I M I I I I I I I I I I I I I MIM I I I I | | | | | | | | | | m I I I I I I I 

181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 

241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 
I N II I I I I I I I I II I I if I I I M I II I I I I | | | | | | | | | | | | | | | I) | | | | | | | 

2 41 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 
301 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 

N I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

301 C C C C C G C AGAAGC T ACAGATT CT C GT T GAC ACT G GAAGCAGTAACT T T GC C GT G GC AGGA 360 

361 AC C C C GC AC T C CT ACAT AGAC AC GTAC T T T GACACAGAGAG GT CT AG C AC AT AC C GCT C C 42 0 

HI' I I I I I I I I I I I I M I I I I I I I I | | | | | | | | | | | | | | | | | | 

361 AC C C C GC ACT C C T ACAT AGAC AC GT ACT T T GACACAGAGAG GT C T AGCAC ATAC C GCT C C 42 0 

421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 8 0 

I I I I I I I i M I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | 
421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 80 

481 GAC CT C GT C AC CAT C C C CAAAGGCT T CAAT AC TTCTTTTCTT GT CAAC AT T GC C ACT AT T 54 0 

MM I II I I II II I II I I I I I I I I | | | | | | | | | | | | | | | | | (J | | | | | | | 

481 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

541 T T T GAAT C AGAGAAT TTCTTTTTGCCT GGG AT TAAAT GGAAT G GAAT ACT T G G C CT AG CT 60 0 

I M M I I I I I II I I I I I I f I I I I I I I I I I I | | | | | | I I I I I I I I M I I I I I I I | | | | | | | 
541 TTT GAAT CAGAGAATTTCTTTTTGCCTGGGATTAAATGGAAT GGAAT ACTTGGCCTAGCT 600 

601 TAT GC C ACAC T T GCC AAGC CAT CAAGT T C T CT G GAGAC CTT C T T C GAC T C C CT G GT GAC A 660 

N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

601 TAT GC C AC ACT T GC CAAG C CAT CAAGT T C T C T G GAGAC CT T C T T C GACT C C CT G GT GAC A 660 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 720 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I | | | | | II I I I 
661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

721 G GAT C T GGGAC CAAC GGAGGT AGT CTT GT CTT GGGT G GAAT T GAAC CAAGT T T GT AT AAA 780 

I I N I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I | M I I I I I I I I I I I I I II I 
721 GGATCTGGGACCAACGGAGGTAGT CTT GT CTT GGGTGGAATT GAAC CAAGTTTGTAT AAA 780 

781 GGAGACAT C T GGT AT AC C C C TAT TAAGGAAGAGT GGT AC T AC C AGAT AGAAATT C T GAAA 84 0 

I I ' I I I I I I I I I I I I | | | M I I I I I I I I I I I I | | | | M I I I I I I I I I I I I | | | | I I I I I I 
781 G GAGAC AT C T GGT AT AC C C C TAT TAAGGAAGAGT G GTAC T AC CAGAT AGAAAT T CT GAAA 84 0 

841 T T GGAAAT T GGAGGC CAAAG C CT TAAT CT GGAC T G CAGAGAGTATAAC G C AGACAAG GCC 900 

I N I I I I I I I I I I I I I I I I II I I I I I I I I I I I I | | | | | | | | I I I I I I I I I I I I I I I I I I I 
841 T T GGAAAT T GGAGG C CAAAG C CT T AAT C T GGAC T G CAGAGAGTATAAC G C AGACAAG GCC 900 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | I I I I I I I I I I I I I I I 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 



961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 

' N I I ! I f I I I I I I I f M | | M I I I I I I I I f f I I I I | | | | | | | | | M | | | | | | | | | | | j | 
961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

N I I M I f I I f I I I I I I I I I f I I I I I I | I I I I I I I I I I I I I I I I I I I I I | | | II I I I I I I 
1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

1081 T AC CT GAGAGAT GAGAACT C C AGC AGGT CAT T C C GT AT C ACAAT C CT GC CT CAG CT T T AC 114 0 

N I I I 1 I I I I i I I I I I II I I I I I I I I I I I I I I I I I I | | | I I I I I I I I | | | | | | | | | | | M 
1081 T AC C T GAGAGAT GAGAAC T C C AGC AG GT CAT T C C GT AT CACAAT C C T GC C T C AGCT T T AC 114 0 

1141 AT T C AGC C CAT GAT GGGGGC C GGC CT GAATT AT GAAT GT T AC C GAT T C G GC ATT T C C C C A 12 00 

> I f I M I I I I I I I I I I I I I II I I I I I I I I I I I I I | | | | | | | I I I I I I I I I j I I M I I I I I 
1141 AT T C AGC C CAT GAT G GGG GCCGGCCT GAAT TAT GAAT GTT AC C GAT T C GGCAT T T C C C C A 1200 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

I I I I I I I I I I I I I I I ! I I I II I I I I I II I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

>> I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | M I I I I I I I | | | | | M I I I 
1261 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1320 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 1380 
I I I M M I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | , | ( | (| | ,,,,, | 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

M I M I I I I I I I I I M I II I I I I I I I I I I I ! | | f | | | | | | | I I I I I I I I I I I I I I I I I I I 
1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 144 0 

1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

I I I ' I I I I I II I I I I I I I I I | | | | I I I I I I I I I I I I I | | | | | I I I I I I I I I I I I I I I j I I 
1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

1501 CGT GAC C CT GAG GT C GT CAAT GAT GAGT C CT CT CT G GT CAGAC AT C G CT GGAAAT GAAT A 1560 

I I ' I ' I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I 
1501 C GT GAC C C T GAGGT CGT CAAT GAT GAGT C C T CT C T GGT CAGAC AT C GCT GGAAAT GAAT A 1560 

1561 GC C AGGC CT GAC CT CAAGCAAC CAT GAACT C AGCT AT T AAGAAAAT C ACAT T T C C AGG G C 162 0 

I I I ' I I I I I I I I I I I M I I I I I I I I I | | M I I I I I I f I I I I I I I | M I I I I I I I I I I I I I 
1561 GC CAGGCCT GAC CT CAAGCAAC CAT GAAC T C AGCT ATT AAGAAAAT CACATT T C CAGGGC 1620 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 
I I I I I I N I II I I I II I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | , | | | | 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1740 

I I I I I I I I I I I I I I I I I I I | | | I I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I I I I I I I I 
1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

17 41 CT C C CT AC T T C CAAGAAAAAT AAT T AAAAAAAAAAC T T CAT T CT AAAC CAAAAAAAAAAA 18 00 

I I i I I I I M I I I I I I I I I I I | I I I I I I I I I I I I I I I I I I I | | | | f M I I I I I M I I I I I I 
1741 CT C C CT AC T T C CAAGAAAAAT AAT T AAAAAAAAAAC T T CAT T CT AAAC CAAAAAAAAAAA 1800 

1801 AAAA 1804 



Db 1801 AAAA 1804 



RESULT 7 
AF178532 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



gene 
CDS 



polyA_s 
polyA s 



AF178532 2990 bp mRNA linear PRI 21-SEP-2000 

Homo sapiens aspartyl protease (BACE2) mRNA, complete cds . 
AF178532 

AF17 8 532 .1 GI : 68512 65 

* 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 2990) 

Solans, A. , Estivill,X. and de La Luna,S. 

A new aspartyl protease on 21q22.3, BACE2, is highly similar to 

Alzheimer's amyloid precursor protein beta-secretase 

Cytogenet. Cell Genet. 89 (3-4), 177-184 (2000) 

20422477 

10965118 

2 (bases 1 to 2990) 

Solans , A. , Estivill,X. and de la Luna,S. 
Direct Submission 

Submitted ( 1 8-AUG-1999 ) Medical and Molecular Genetics Center, IRO, 
Avia. Castelldefels Km 2,7, L'Hospitalet de Llobregat, Barcelona 
08907, Spain 

Location/Qualifiers 
1. .2990 

/organism="Homo sapiens" 

/mol_t ype= "mRNA" 

/db_xref="taxon: 9606" 

/ chr omos ome= " 21" 

/map="21q22 . 3" 

/clone="YAC 552A3" 

1. .2990 

/gene="BACE2" 

464. .2020 

/gene="BACE2" 

/ codon^start^l 

/product="aspartyl protease" 
/protein_id="AAF2 94 94 . 1" 
/db_xref="GI : 6851266" 

/ translation="MGALARALLLPLIiAQWLLRAAPELAPAP FTLPLRVAAATNRWA 

PTPGPGTPAERHADGLALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPP 

QKLQILVDTGSSNFAVAGTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGE 

DLVTIPKGFNTSFLVNIATIFESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSL 

VTQANIPNVFSMQMCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQI 

EILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSD 

GFWTGSQLACWTNSETPWSYFPKISI YLRDENSSRSFRITILPQLYIQPMMGAGLNYE 

CYRFGISPSTNALVIGATVMEGFYVIFDRAQKRVGFAASPCAEIAGAAVSEI SGPFST 

EDVASNCVPAQSLSEPILWIVSYALMSVCGAILLVLIVLLLLPFRCQRRPRDPEWND 
ESSLVRHRWK" 

ignal 2225. .2231 

/gene="BACE2" 

ignal 2959. .2964 



ORIGIN 



/gene="BACE2 M 



Query Match 99.3%; Score 1790.6; DB 9; Length 2990; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 17 93; Conservative 0; Mismatches 4; Indels 0; Gaps 0; 



Qy 


i 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 

1 1 1 1 1 1 1 1 M 1 M f 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 I ! I I | | | | | | | | | | | | | | | | | | | i f | | | | 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


464 


523 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 
1 1 1 M 1 1 1 1 II 1 I I I I I I | I | | | | | | j | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


524 


583 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 M 1 1 1 1 1 1 1 1 f 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 | | | 1 1 1 
CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


584 


643 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 
1 1 II 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 | | | | | | | | 1 1 1 1 1 E 1 1 1 1 1 1 1 1 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 

W 


Db 


644 


703 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 

1 I 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | | I M 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 

xy v./ 


Db 


704 


763 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 

M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 I 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 


360 


Db 


764 


823 


Qy 


361 


AC C C C GCAC T C C TACAT AGACAC GTAC T T T GAC AC AGAGAG GT CT AGCACAT AC C GCT C C 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II J 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 I 1 1 

AC C C C G CAC T C CT AC AT AGACAC GTACT T T GAC AC AGAGAG GT CTAG CACAT AC CG C T C C 


420 


Db 


824 


8 83 


Qy 


421 


AAG GG C TT T GAC GT C ACAGT GAAGTAC AC ACAAG GAAG CT G GACG GG CT T C GT T GG G GAA 

N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I || | | | | | | | M | | | | | | | | | | | | || | | j | 

AAG GGC TT T GAC GT CACAGT GAAGTAC ACACAAG GAAG CT G GAC G GGCT T C GT T GGG GAA 


480 


Db 


884 


943 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 

f f 1 1 1 1 M 1 1 1 1 1 1 1 II I I I I | | | i | | | | | | M | | | || | | | | | | | | | | | || j | | | | | | j ( 

GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


944 


1003 


Qy 


541 


T T T GAAT C AGAGAAT TTCTTTTTGCC T GGGAT TAAAT G GAAT GGAAT AC T T G GC C T AGCT 

1 I 1 1 | 1 1 1 I I i I I I I i i i i i i i i i j i i i i i i i t i i i i i i i i i i i i i i i i i , 

1 ' 1 1 1 1 1 1 1 If 1 1 1 1 1 M 1 1 M I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | | | | | 

T T T GAAT C AGAGAAT TTCTTTTTGCC T GGGAT TAAAT GGAAT GGAAT AC T T G GC C T AGCT 


600 


Db 


1004 


1063 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 

1 1 1 I 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 | | | M 1 1 1 1 1 1 1 1 1 
T AT GC C AC ACT T G C C AAG C CAT CAAGTT C T C T G GAGAC C TT CT T C GACT C C C T GGT GAC A 


660 


Db 


1064 


1123 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I 1 
CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


1124 


1183 


Qy 


721 


G GAT C T GGGAC CAAC G GAG GTAGT C T T GT CT T G G GT GGAAT T GAAC C AAGTT T GTATAAA 

1 1 N 1 1 1 i 1 1 1 I I | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 


780 



118 4 G GAT CT GG GAC CAAC G GAG GTAGT CT T GT CT T GGGT GGAAT T GAAC C AAGT T T GTATAAA 124 3 

781 GGAGACAT CT GGTATACC CCTATTAAGGAAGAGT GGTACTACCAGAT AGAAATT CT GAAA 84 0 
I I I M t I I I I I I I M I I I I I I I I I I I I I I I I I M ( I I M t I I I I | | M | | | | | | | | | | | | 
1244 GGAGACAT CT GGT AT AC C C C TAT T AAGGAAGAGT GGTAC T AC CAGAT AGAAATT CT GAAA 1303 

841 TT GGAAAT T G GAG G C CAAAG C CT T AAT CT GGACT GC AGAGAG TAT AAC G C AG AC AAG GC C 9 00 

M M I I I I I I I I I I I I I | I I | | | | | | | M I I M I I I I I I I I I I | | | I I II I I I I I I | I I I 

1304 TT GGAAAT T GGAGGC CAAAG C CT T AAT C T GGACT GC AGAGAGT ATAAC GC AG ACAAG GC C 1363 
901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

I I f M I I ! I I I I M II I I I I I I M I I I I I II I I I I I I I II I I I I I I I I I I I | | | I I I I I I 

1364 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1423 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCT GAT GGTTTCT GGACT GGGTCC 1020 
I I I I i I I I M I M I I I I M I I I I I I I I I I I I I I I I I I I I | | | | I I I I I I I | | | | | | | | | | 
1424 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCT GAT GGTTTCT GGACT GGGTCC 14 83 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

I ' I I I M I I I I I I I I I | | | f | | | | | | | | f | | | M | | | I I II I I I I I I I I I I I I I I I I I I I 
1484 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1543 

1081 T AC C T G AGAGAT GAGAACT CC AGCAGGT CAT T C C GT AT CACAAT C C T G C C T C AGCT T T AC 114 0 

I I M I I I I I I I I I I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I I I I I I | | I f I I I I I 
154 4 T AC C T GAGAGAC GAGAACT C CAGCAGGT CAT T C C GTAT CACAAT C C T GC C T C AGCTT T AC 1603 

1141 AT T C AGC C CAT GAT GGGGG C C GGC CT GAAT TAT GAAT GTT AC C GATT C GGC AT T T C C C CA 12 00 

I I ' I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I IE 
1604 AT T C AGC C CAT GAT GGGGG C C GGC CT GAAT TAT GAAT GT T AC C GAT T C GGC AT TT C C C CA 1663 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 
I N I I I I I I II I I I I I | | | | | | | | M | | | | | | | | | | | | | | | | | | | || | | | | | | | | | | | , | 

1664 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1723 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

I N I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | 
172 4 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCGGGTGCTGCA 17 8 3 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

I I I I I I i I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I I I I I 
17 8 4 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 184 3 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 144 0 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II | | | | | | | | | | | | | | I | | | | | | | | 

184 4 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1903 

1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGT CAGCGTCGCCCC 1500 

I I I I f I I M I I I I I I I I I I | I I I I I I I I f I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
1904 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGT CAGCGTCGCCCC 19 63 

1501 CGTGACCCTGAGGTCGTCAATGATGAGTCCTCTCTGGTCAGACATCGCTGGAAATGAATA 1560 

N f I I I I I M I I I I I I I I I I I I II I I ! I I I I | | | | | | | | | | | | | | | | | | | | | | | | | || | | 
1964 C GT GAC C C T GAGGT C GT C AAT GAT GAGT C CT CT C T GGT C AGAC AT C GCT GGAAAT GAAT A 2023 

1561 G C C AG G C C T GAC C T C AAG CAAC CAT GAAC T C AG C TAT T AAG AAAAT C AC AT T T C C AG GGC 1620 

t I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I M I I I I I I I I I II I I I I I I I I I 
2 024 G C C AG GC CT GAC CT C AAGCAAC CAT GAACT C AG C TAT TAAGAAAAT CAC AT T T C C AG GG C 2 08 3 



Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 
1 1 1 1 1 1 M M 1 1 1 1 1 1 I 1 I 1 1 1 M 1 I I 1 I I I I I M | | | | t 1 1 ! 1 1 M I | | | | | | | | | | M 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


2084 


2143 


Qy 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 

' 1 1 1 1 1 I 1 1 1 f M 1 1 1 1 1 1 I I I I | | | | | | | | | | 1 1 1 1 1 1 1 | 1 1 1 1 1 ! 1 1 1 1 1 | | | | | 1 1 1 
GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


2144 


2203 


Qy 


1741 


CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAAACCAAAAAAAA 17 97 

1 1 f 1 f 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 II 1 1 1 1 1 I 1 I | | | | | | 

C T C C CTACT T C CAAGAAAAATAAT TAAAAAAAAAACT T CAT T C T AAAC CAAAAC AGA 22 60 


Db 


2204 



RESULT 8 
AF200192 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



AF200192 1885 bp mRNA linear PRI 16-FEB-2000 

Homo sapiens memapsin 1 mRNA, complete cds . 

AF200192 

AF2 00192. 1 GI: 6470290 

* 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1885) 

Lin,X., Koelsch,G., Wu,S., Downs, D., Dashti,A. and Tang, J. 

Human aspartic protease memapsin 2 cleaves the beta-secretase site 

of beta-amyloid precursor protein 

Proc. Natl. Acad. Sci. U.S.A. 97 (4), 1456-1460 (2000) 

20144060 

10677483 

2 (bases 1 to 1885) 

Lin,X., Koelsch,G. and Tang, J. 
Direct Submission 

Submitted (28-OCT-1999) Protein Studies Program, Oklahoma Medical 
Research Foundation, 825 N.E. 13th Street, Oklahoma City, OK 73104 
USA 

Location/Qualifiers 
1. .1885 

/organism="Homo sapiens" 
/mol_t ype= "mRNA" 
/db_xref="taxon: 9606" 
78. .1634 

/function="cleaves beta-secretase site of beta-amyloid 
precursor protein" 

/note="membrane-associated aspartic protease 1" 
/ codon_s tart=l 

/product="memapsin 1" 
/protein_id="AAF13714 . 1" 
/db_xref="GI : 6470291" 

/translation=" MGALARALL L P L LAQWL LRAAP E LAPAP FT L P LRVTAATN RWA 

PTPGPGT PAERHADGLALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPP 

QKLQILVDTGSSNFAVAGTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGE 

DLVTIPKGFNTSFLVNIATIFESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSL 

VTQANIPNVFSMQMCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQI 

EILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSD 



GFWTGSQLACWTNSETPWSYFPKISI YLRDENSSRSFRITILPQLYIQPMMGAGLNYE 
CYRFGI S PSTNALVI GATVMEGFYVI FDRAQKRVGFAAS PCAEIAGAAVSEI SGPFST 

EDVASNCVPAQSLSEPILWIVSYALMSVCGAILLVLIVLLLLPFRCQRRPRDPEWND 
ESSLVRHRWK" 

ORIGIN 



Query Match 99.1%; Score 1788.2; DB 9; Length 1885; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 1801; Conservative 0; Mismatches 3; Indels 1; Gaps 1; 



Qy 


1 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 
' 1 N 1 1 II 1 1 1 1 1 M 1 1 1 1 I I II 1 1 1 1 I I I I f | | | | | | | f | | | | | j | | | | | M | | | | | | | 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


78 


137 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | I I I M 1 1 1 f M 1 1 1 1 1 1 1 1 1 1 I I I | | | | | I I I I I | | | | 1 1 1 1 1 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGACCGCGGCCACGAAC 


120 


Db 


138 


197 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I I I 1 1 1 1 1 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


198 


257 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

N M 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 | | 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I If 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


258 


317 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | 1 1 1 1 1 1 1 1 I I I I I I I | 1 1 1 1 1 1 1 1 1 1 | | | | 1 1 1 1 1 1 1 1 
GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


318 


377 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 
N 1 1 1 1 I 1 1 1 1 1 1 II 1 M 1 1 1 1 1 1 1 I I I I I I | | || | | | | | | | | | | | | | | | | | | | | | | | | | 
C C C C C GC AGAAG C T ACAGAT T CT C GT T GAC ACT GGAAGC AGTAACTT T G C C GT GG CAG GA 


360 


Db 


378 


437 


Qy 


361 


AC C C C GC AC T C C T ACAT AGAC AC GT ACT T T GACAC AGAGAG GT CT AGC ACAT AC C G CT C C 

1 1 1 1 N 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AC C C C GC ACT C CT ACAT AGAC AC GT AC T T T GAC AC AGAGAGGT CT AGCACAT AC C GCTC C 


420 


Db 


438 


497 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


498 


557 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 

M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


558 


617 


Qy 


541 


T T T GAAT C AGAGAAT TTCTTTTTGCCT GG GAT T AAAT GGAAT GGAAT ACT T GGC CT AGC T 

I i 1 ' N 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTT GAAT CAGAGAATTTCTTTTTGCCTGGGATTAAAT GGAAT GGAAT ACTT GGC CTAGCT 


600 


Db 


618 


677 


Qy 


601 


TAT GC CACACT T GC CAAG C CAT CAAGT T C T C T G GAGAC C T T C T TC GACT C C CT GGT GAC A 

N M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | | | | 

TAT GCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTC GACT CCCT GGT GACA 


660 


Db 


678 


737 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 

" M 1 1 1 1 1 | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


738 


797 



721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 780 

N I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | I I I I I | | | | I I I 

798 G GAT C T GGGAC CAAC G GAGGT AGT C T T GT C TT GG GT G GAAT T GAAC C AAGT T T GTAT AAA 857 

7 81 G GAGACAT CT GGT AT AC C C C TAT T AAG GAAGAGT G GT AC T AC CAGAT AGAAAT T CT GAAA 84 0 

I I I I I I I I I I I t I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | M | 

858 GGAGACATCTGGTATACCCCTATTAAGGAAGAGTGGTACTACCAGATAGAAATTCTGAAA 917 

841 TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 900 

" I M I f I M ! I I I I I I I I I I I I I f I I I I I I I I I (I 1 I I | | | | M I I I I I I I I 

918 TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 977 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

I ' ' I I I M I ! I II I I I I I I I I I I | | | | M I I I I I I I I I | | | | | | | | | M I M I I I I I I I I 
97 8 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1037 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 
N I M I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I I I I I I I I I I I I I I II I II I I I I I I I 
1038 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1097 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 
I I I M I I I I I I M II I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | , , , | , 

1098 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1157 

1081 T ACC T GAGAGAT GAGAAC T C CAGCAGGT CAT T C CGT AT CACAAT C CT G C CT CAGCT T T AC 114 0 

I I ' I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1158 T AC CT GAGAGAC GAGAACT C CAGCAGGT CAT T C C GT AT CACAAT C CT GC CT CAGCTT T AC 1217 

1141 AT T C AGC C CAT GAT GGGGGC C G GC C T GAATT AT GAAT GT T AC C GAT T C GG C ATT T C C C C A 12 00 

' I I I I " I I I I I I I I I I I I I I I I II I I I f I I I I I I I M I I I I I I I I I I I I I I I I I 

1218 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1277 

1201 T C C ACAAAT GC G CT G GT GAT C GGT GC CAC GGT GAT GGAGG G C T T C T AC GT CAT CT T C GAC 1260 

I ' I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | I I I I I I f I I I M | I I I I I I I | | | | | | I I 
127 8 T C C ACAAAT GC GC T G GT GAT C G GT GC CAC GGT GAT GGAGGGC T T CT AC GT CAT C T T C GAC 1337 

1261 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1320 

I I I I " I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | 

133 8 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1397 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

I I " I I I I I I I I I I I I I I I | I I I I I I I I I I I | | | | I I I I I I I | | | | | I I I I I M I I I I I I 
1398 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 1457 

13 81 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

" I I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I | I I I I I I I 

14 5 8 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1517 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

I I I I " I I I I I I I I I I | | | | I I I I | | | M II I I I M I I I I I I I I I I I I I I I I I I I 

1518 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1577 

1501 CGT GAC C C T GAGGT C GT CAAT GAT GAGT C CT C T CT G GT CAGACAT C GC T G GAAAT GAAT A 1560 

I I ' ' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | I I I I I I I I I I | | | I I I I I I I I 
157 8 CGT GAC C C T GAGGT C GT CAAT GAT GAGT CCTCTCTGGT CAGACAT C GCT G GAAAT GAAT A 1637 



Qy 1561 G C C AGG C CT GAC CT CAAG CAAC CAT GAAC T C AG CT AT TAAGAAAAT C AC AT T T C CAGGG C 162 0 

I M I I I I I I I I I I I I I I | | | | ! | | | | | | | M I I II I I I I I I | | M I I I I I I I I II | | M I 
Db 1638 G C C AG GC C T GAC C T CAAGCAAC CAT GAACT C AGC TAT TAAGAAAAT C AC AT TT C CAG GGC 1697 

Qy 1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 1680 

' ' I I II I I I I I I I I I I | | | | M I I I M I I I I II I I I I | | | M I I I I II I I I I I I | | | | | | 
Db 1698 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 1757 

Qy 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1740 

f I I I I II I I I I I I I | | | | | M I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | 
Db 1758 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1817 

Qy 17 41 C T C C CT ACT T C C AAGAAAAATAATT - AAAAAAAAAACT T CAT T C T AAAC CAAAAAAAAAA 1799 

1 I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 1818 CT C C CT AC T T C CAAGAAAAACAAT T AAAAAAAAAAACT T CAT T C TAAAC CAAAAAAAAAA 1877 

Qy 1800 AAAAA 1804 

I I I I I 

Db 1878 AAAAA 1882 
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AX376004 

Sequence 71 from Patent 
AX376004 

AX3 760 04 .1 GI : 1917 0395 



1879 bp 
WO0168848. 



DNA 



linear 



PAT 01-MAR-2 002 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Baker, K. P., Chen, J. , Desnoyers , L . , Goddard,A., Godowski, P . J . , 
Gurney,A.L., Pan, J., Smith, v., Watanabe, C . K . , Wood, W.I. and 
Zhang, Z . 

Secreted and transmembrane polypeptides and nucleic acids encoding 
the same 

Patent: WO 0168848-A 71 20-SEP-2001; 
Genentech, Inc. (US) 

Location/Qualifiers 
1. .1879 

/organism= M Homo sapiens" 
/mol_type= M unassigned DNA" 
/db xref="taxon: 9606" 



ORIGIN 



Query Match 98.9%; Score 1784.4; 

Best Local Similarity 99.9%; Pred. No. 0; 
Matches 17 85; Conservative 0; Mismatches 



DB 6; Length 1879; 



1; Indels 



0; Gaps 



0; 



Qy 

Db 



1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

IN M I I I I I I I I I I | | | | | | | | | | | | | | | | | inn, i 

94 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 153 



Qy 



61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

N I I M I I M M I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I I I 



154 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 213 

121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 
I I I I M I I I I I i II I I I I I I I I I I | | | | | | | | M | | | | | | | | | | | | | | | | | | | , | , | | | | 

214 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 2 73 

181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 2 4 0 
I I I I I I I I I I I M I I II I I I I I I I M I I I | | || | | | | | | | | | | | | | | | | | || | | ( | | , || 

2 74 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 333 

241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 30 0 

N I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I I I I I | | I I 

334 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 3 93 

301 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 3 60 
I I I ' I I I I I I I I I I M I I I I I M M I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I 

3 94 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 4 53 

361 AC C C CG C ACT C CT AC AT AGACAC GT ACT T T GACACAGAGAG GT C TAG CAC AT AC CGCT C C 4 2 0 

I I I I I N I I I I I I I I I I I I | M | | | I I I I I I I I I I I I I I I | | | | | | | | | | | I I I I I I I I I 

4 54 AC C C C GCAC T C C T ACATAG ACAC GT ACT T T GACACAGAGAG GT CT AGCAC AT AC C GC T C C 513 

421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 8 0 

I I I I I I I I I I I I I I I M I I I I I I I I || I I I I I I I | | | | | | | | | M | | | M | | | | | | | | | | 
514 AAG GG C T T T GAC GT C ACAGT GAAGT AC AC ACAAG GAAGCT G GAC G GGCT T C GT T GGG GAA 573 

4 81 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

I I N I I I I I I I I I I I I I If | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I 
574 GACC T C GT CAC CAT C C C CAAAGGC T T C AAT ACT T CTTTTCTT GT CAAC AT T GC CAC TAT T 633 

541 T TT GAAT C AGAGAATT T CTTTTTGCCT GG GAT TAAAT GGAAT G GAAT ACT T GGC CTAG CT 600 

" I I I I I I I f f I M I I I I I I I I I I I I I | | | | | | | | | | | I I I I I I I I I I I I I I I I I 

634 T T T GAAT C AGAGAATT T CT T T TT G C C T GGGAT TAAAT G GAAT GGAAT AC T T GGC C T AGCT 693 

601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 

I I N I I I I I I I I I I I I M I I I I I I I I I | | | | | I I I I I I I I I I I I I I I I I I I I I I 

694 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 753 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

N I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
754 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 813 

721 GGAT CT GGGAC CAAC GGAG GTAGT C T T GT CTTGGGTG GAAT T GAAC CAAGT TT GTATAAA 7 80 

I I I I I i I I I I I I I I I I I I I | | | | | | | II I I I I I I I I I | | | | | | I | | | | | | | | | | I I I I I I 
814 GGAT C T G G GAC CAAC G GAGGTAGT CTTGTCTTGG GT GGAAT T GAAC C AAGTT T GTATAAA 873 

781 GGAGAC AT C T GGT AT AC C C CT AT TAAG GAAG AGT G GT AC T AC CAGAT AGAAAT T CT GAAA 84 0 

I ' I I I I I I I I I I I I I I I I I M I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | , | | | 
874 GGAG AC AT C T G GT AT AC C C C TAT TAAG GAAGAGT G GT ACT AC CAGAT AGAAATT CT GAAA 933 

841 T T GGAAAT T G GAGG C CAAAGC CT T AAT CT GGACT GCAGAGAGTAT AAC G C AGACAAGG C C 90 0 

f I I II II II I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

934 TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 993 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

I I I I I I I I I I M I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I I I I I I I I I I MM 

994 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1053 



9 61 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

MINI I I I I I II I I I I I I I | | | | | | M I i I II I I I | | f | | | | M | | M | | | | | | 

1054 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1113 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

I ' I I M I M M I I I I I II I I I I I I I I I I | I | | | | I I I I I I | | I I I I I I I M I I I I 

1114 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1173 

1081 T AC CT GAGAGAT GAGAAC T C C AGCAGGT C AT T C C GT AT C ACAAT C C T GC CT C AGCT T T AC 114 0 

I M I I I I I I I I II I I I I I I I I I I I | | | | | | || I I I I I I I I | | | | | | M | | | | | 

117 4 TACCTGAGAGACGAGAACTCCAGCAGGTCATTCCGTATCACAATCCTGCCTCAGCTTTAC 123 3 

1141 AT T C AGC C CAT GAT GGGG GC C G GCCT GAAT TAT GAAT GT T AC C GAT T CGGC AT T T C C C C A 1200 

I I I I I I I I I I I I I I I I M I I I I I f I I I I I I I I I I I I I | | | M I I I I | | | | | | | | M I I I I 
1234 AT T CAGC C CAT GAT GGGGGC C GGC C T GAAT TAT GAAT GT T AC C GATT C GG CAT T T C C C C A 1293 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

I I I I I I I I I I I I I I I I I I I | | | | | | | | | M I I I I I I I I I I II I I I I I I I I I I I I I I | I M 
12 94 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1353 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

N I M I I I I II I I I I I I I I I I I I I I I I I M I I I II I I I I I M I I I I I I I I I I I I I I I I I | 
1354 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1413 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 1380 

N I I I I I I I I I I I I I I I I I I I I M I | | | | | | | | I | | | | M | | | | I) | | | | | | | | , 

1414 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 14 73 

13 81 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I I I I | | | | | I I I I I I I | | | 

14 74 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1533 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

>' I M I I I I I I I I I I I I I I I II I I I I I I I I | | I I M I I I I I I I I II I I I I I I I I I 

1534 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1593 

1501 C GT GAC C CT GAG GT C GT CAAT GAT GAGT CCT C T C T GGT C AGAC AT C GC T GGAAAT GAAT A 1560 

' I I I I I M I I I I I I I I I I I I I I I | | I I I I I I I I j I I | | | | | | | | | | | M I I I I I I I II I I 

15 94 C GT GAC CCT GAG GT C GT CAAT GAT GAGT C C T CT CT GGT CAGAC AT C GCT GGAAAT GAAT A 1653 

15 61 GCCAGGCCTGACCTCAAGCAACCATGAACTCAGCTATTAAGAAAATCACATTTCCAGGGC 1620 
>' I M I I I I I I I I I I | | | | || | | | | | | | M | | | || | | | | | | | | | | | j , , | | | | | , 

1654 GC C AGG C CT GAC CT CAAGCAAC CAT GAAC T CAGC TAT T AAGAAAAT C ACATT T C C AG GGC 1713 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

' I I I I I I I I I I I I I | | | | | I I I I I I I I I I | | | f| I I I I I I I I If I I | | 

1714 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 1773 

1681 GCT CCCAGATGCCTTCTAGATTCACTGTCTTTT GATT CTTGATTTTCAAGCTTTCAAATC 17 4 0 

' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1774 GCT CCCAGATGCCTTCTAGATTCACTGTCTTTT GATT CTTGATTTTCAAGCTTTCAAATC 18 33 

1741 CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAA 1786 

I I I M I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | j 
1834 CT C C C T ACT T C CAAGAAAAATAAT T AAAAAAAAAACT T CAT T CTAA 187 9 
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AY358927 1879 bp mRNA linear PRI 03-OCT-2003 

Homo sapiens clone DNA4 54 93 BACE2 (UNQ418) mRNA, complete cds 
AY358927 

AY358 927 .1 GI : 37182 971 
FLI_CDNA. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1879) 

Clark, H.F., Gurney,A.L., Abaya,E., Baker, K. , Baldwin, D . , Brush, J., 
Chen, J., Chow, B., Chui,C, Crowley, C, Currell,B., Deuel, B., 
Dowd,P., Eaton, D., Foster, J., Grimaldi,C, Gu,Q., Hass,P.E., 
Heldens,S., Huang, A. , Kim,H.S., Klimowski, L. , Jin,Y., Johnson, S., 
Lee, J., Lewis, L. , Liao,D., Mark,M., Robbie, E., Sanchez, C, 
Schoenfeld, J. , Seshagiri , S . , Simmons, L., Singh, J., Smith, V., 
Stinson,J., Vagts,A. , Vandlen,R., Watanabe,C, Wieand,D., Woods, K., 
Xie,M.H., Yansura,D., Yi,S., Yu,G., Yuan, J., Zhang, M. , Zhang,Z., 
Goddard,A., Wood, W.I. and Godowski,P. 

The Secreted Protein Discovery Initiative (SPDI), a Large-Scale 

Effort to Identify Novel Human Secreted and Transmembrane Proteins: 

A Bioinf ormatics Assessment 

Genome Res. 13 {10), 2265-2270 (2003) 

12975309 

2 (bases 1 to 1879) 
Clark, H. F. 
Direct Submission 

Submitted (01-AUG-2003) Department of Bioinf ormatics , Genentech, 
Inc., 1 DNA Way, South San Francisco, CA 94080, USA 

Location/Qualifiers 

1. .1879 

/organism-"Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/ cl one= " DNA4 54 93" 
1. .1879 

/locus_tag="UNQ4 18 " 
94. .1650 

/locus_tag="UNQ418" 
/note="PR0852" 
/ codon_start=l 
/product="BACE2" 
/protein_id="AAQ8 928 6. 1" 
/db_xref="GI : 37182972" 

/ trans lation="MGALARALLLPLLAQWLLRAAPELAPAP FTLPLRVAAATNRWA 
P T P G P GT P AE RHAD GLALAL E P ALAS P AG AAN F LAMVDN LQ G D S GRG Y Y L EML I GT P P 

QKLQILVDTGSSNFAVAGTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGE 

DLVTIPKGFNTSFLVNIATIFESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSL 

VTQANIPNVFSMQMCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQI 

EILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQPCVFDAWEAVARASLIPEFSD 

GFWTGSQLACWTNSETPWSYFPKISI YLRDENSSRSFRITILPQLYIQPMMGAGLNYE 

CYRFGISPSTNALVIGATVMEGFYVI FDRAQKRVGFAAS PCAEIAGAAVSEI SGPFST 

EDVASNCVPAQSLSEPILWIVSYALMSVCGAILLVLIVLLLLPFRCQRRPRDPEWND 
ESSLVRHRWK" 



Match 98.9%; Score 1784.4; DB 9; Length 1879; 

ocal Similarity 99.9%; Pred. No. 0; 

s 17 85; Conservative 0; Mismatches 1; Indels 0; Gaps 0 

1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 
I I I I I M I I I M I I I I I I I I I I I I I I I M I I I I I I I I | | | | | | | i | | | | | | | | | | | | | j | 
9 4 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 153 

61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

I > N I I I I I I I I I I I I I I | | | | | | I | | | | | | | M I I I I I M f I I M I I I I I I I I I I I I I f 

154 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 213 
121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

I I ' N I I I I f I I I I I I I I I I M I I I I I I I I I I I I I M If I I I I 1 I I | I | | | | | | | | | | | | 

214 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 2 73 
181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 

I I I I I I I I I I M I M I I I I I I M I I I I I I I I II I I I I I I I I I II I M I I f I I I I I I I I I | 

2 74 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 333 

241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 
I 'I I I I M I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | j | | | | | | | f | | | | j | 

334 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 3 93 

3 01 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 

I I I I I I I I I I I I I I I I I I I I II I I I M I I I I II I I I I I I f I I I I I I I I | | | | | I I I M M 
394 CC C C C GCAGAAGC T ACAGAT T CT C GT T GAC AC T GGAAGC AGTAAC T T T GC C GT GG CAGGA 4 53 

361 AC C C C G CACT C C T AC AT AG AC AC G TACT T T GAC ACAGAGAGGT CT AGCAC AT AC C GCT C C 42 0 

t I I I I M I I I M I I I I f I I I I i I I I I I ! | I M I I M II I I I I I I M I I I I I I I I I I I I I I 

4 54 AC C C C GCACT C C T AC AT AGACAC GT ACT T T GAC ACAGAGAGGT CT AGCAC AT AC C GCT C C 513 

421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 80 

I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I | | | | | | | | | | | | | 
514 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 573 

481 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

N I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I M I I II I I I I I I I I I I 
574 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 633 

541 T T T GAAT C AGAGAAT TTCTTTTTGCCT GGGAT T AAAT GGAAT G GAAT AC T T GGC C TAG CT 600 

' I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I | | I I j | | | | | | | 
634 T T T GAAT C AGAGAAT TTCTTTTTGCCTGG GAT T AAAT GGAAT G GAAT ACTT GGC CT AGCT 693 

601 T AT GC CAC AC T T GC C AAGC CAT CAAGT T CT C T G GAGAC C T T C T T C GACT C C CT G GT GACA 660 

I > I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | f ! | | | j | M I I I I I | I I I I I I I I I I I I 
69 4 T AT GC CAC ACT T GC CAAGC C AT CAAGT T C T C T G GAGAC CT T C T T C GAC TCCCTGGT GACA 753 

661 C7VAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

' I I I M I I I I II II I I I I | | | | | | I | | | | | | | | | | M I I I I I I I I I | 

754 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 813 

721 G GAT C T G GGAC CAAC G GAG GT AGT CT T GT C T T GG GT G GAATT GAAC CAAGT T T GT AT AAA 78 0 

I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | || 

814 GGAT CT GGGAC CAAC GGAGGTAGTCTTGTCTTGGGT GGAAT T GAAC CAAGTTTGT AT AAA 87 3 



781 GGAGAC AT C T GGT ATACC C CT AT TAAGGAAGAGT G GT AC T AC CAGAT AGAAAT T C T GAAA 84 0 

>>' I I I I 1 I f I I I I I 1 I I I M I I I I I I I | | | | | | ! | | | | | | | | | | | | [ | | | | | f| | , || | 
874 GGAGAC AT C T GGT AT ACC C CT ATTAAG GAAGAGT G GTAC TAC C AGATAGAAAT T C T GAAA 933 

841 T T G GAAATT G GAGG C CAAAGC C T T AAT C T GGACT GC AGAGAGT ATAAC GC AGACAAG GC C 900 

1 I I I I I I I 1 I I I I I I I I I f I ! I I 1 | | | | | | I | | | | | M I I I I M I I I I M I I I I I f I I I I 
934 T T GGAAAT T GGAG GC CAAAGC CT T AAT C T GGACT G C AGAGAGT AT AAC GC AGACAAG GC C 993 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

N H I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I | | | | | | Ml Ml | 
994 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1053 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

" I I I I I I I I I I I I I I M I I I I I I I I I I I I I f I I I I I I I I f I I I I I I I | | | | I I I 

1054 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1113 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

' I I I I I I I I || | M | M I I I I I I I I I I | | | | | | | | | | M II I I I | | M | | | | | | | 

1114 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1173 

1081 TAC CT GAGAGAT GAGAACT C CAGC AGGT CAT T C C GT AT C ACAAT CCT G C CT CAGCT TT AC 114 0 

I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I | | M I I I I I I I I I I I | | | | | | | 
1174 TAC CT GAGAGAC GAGAACT C CAGC AGGT CATT C C GT ATCACAATCCT GC CT CAGCTTTAC 1233 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1200 

'<> I I I I I I I I I I | | | | | | I I I I I I I I I I I I I I I I I I I I II I I I I I I I | | | I I I I I I I I I 
1234 AT T CAGC C C ATGAT GGGGGC C GGC CT GAAT TAT GAAT GT T AC C GAT T C G G C ATTT C C C CA 1293 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

i > I I I I I I I I I I I I I I I I II I I I I I I I I I | II I I I I I I I I I I I I I I I | | | | | | | | | | | | | 
12 94 T C C ACAAAT GC G C T GGT GAT C G GT GC C AC G GT GAT GGAGG G C TT CT AC GT C AT CT T C GAC 1353 

1261 AGAG C C C AGAAGAGG GT GGGCT T C GCAG C GAG C C C C T GT GCAGAAATT G CAGGT G C T G C A 132 0 

' ' '>> I I I I I I I I I I I I I I I I I I I I I | | I I M I I I I I I I I I I I I I I I I I I I I I I I I I I | | 
1354 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1413 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 1380 

I I I I I I I I I I I I I I I I I I I | | | M I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

1414 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 14 73 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 144 0 

> > I M I I I I I I I I I I I I I | | | | | M II I I I I I I I | | | | | | | | | | | | 

1474 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1533 

1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

<"l INN I I I I I | | | | | I I I I I I I I I I I I I I I I I | | I I I I I 

1534 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1593 

1501 C GT GACCCT GAGGT C GT CAAT GAT GAGT C CT CTCT GGT CAGACAT C GCT GGAAAT GAAT A 1560 

' I I ' I I I I ' I N I I N I I I I I I I I I I I I | | | M I I I I I I I I I I I I I I I I I I I | | | | | I II 

1594 C GT GAC CCT GAGGT C GT CAAT GAT GAGT C CT CT C T GGT CAGACAT C G CT G GAAAT GAAT A 1653 

15 61 GCCAGGCCT GAC CTCAAGCAACCATGAACT CAGCT ATT AAGAAAATCACATTTCCAGGGC 162 0 

I " I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I M 

1654 GC C AGGC CT GAC C T CAAG CAAC CAT GAAC T CAGC TAT TAAGAAAAT CAC AT T T C C AGGGC 1713 



Qy 


1621 


Db 


1714 


Qy 


1681 


Db 


1774 


Qy 


1741 


Db 


1834 



AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 1680 

I M ! I I ! I I I I I I I I ! | | | | | M | | | | | | | | | M | I I I I I I I I I I I I I I I If | ! | | | | | | 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 1773 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

I ' I I I I f I I I I ! I I I I | | | | | | | | | | | | ! f| I I I I I I I I I I I I I | f | | I I | M I I I I I I I 
GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 18 33 

CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAA 17 8 6 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I I I I I I I I | | 
C T C C C T AC T T C C AAG AAAAAT AAT T AAAAAAAAAAC T T CAT T C T AA 187 9 
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LOCUS 

DEFINITION 
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VERSION 
KEYWORDS 
SOURCE 

ORGANISM 
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TITLE 

JOURNAL 
MEDLINE 
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AUTHORS 
TITLE 
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REMARK 



BC014453 1864 bp mRNA linear PRI 04-OCT-2003 

Homo sapiens beta-site APP-cleaving enzyme 2, transcript variant a, 
mRNA ( cDNA clone MGC:23029 IMAGE : 4 8 68 925 ) , complete cds 
BC014453 

BC0144 53. 1 GI : 1568 02 03 
MGC. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1864) 

Strausberg,R.L. , Feingold, E. A. , Grouse, L . H . , Derge, J.G. , 

Klausner, R. D. , Collins , F. S . , Wagner, L., Shenmen, C . M. , Schuler , G . D . , 

Altschul,S.F. , Zeeberg,B., Buetow,K.H., Schaef er , C . F. , Bhat,N.K., 

Hopkins , R. F . , Jordan, H., Moore, T., Max, S . I . , Wang, J., Hsieh,F., 

Diatchenko,L. , Marusina,K., Farmer, A. A. , Rubin, G.M., Hong,L., 

Stapleton,M. , Soares,M.B. , Bonaldo,M.F. , Casavant , T . L . , 

Scheet z , T . E . , Browns tein, M . J . , Usdin, T . B . , Toshiyuki, S . , 

Carninci,P., Prange,C, Raha,S.S., Loquellano, N . A. , Peters, G. J., 

Abramson, R. D. , Mullahy, S . J. , Bosak, S .A. , McEwan, P. J. , 

McKernan, K. J. , Malek,J.A., Gunaratne, P . H . , Richards, S., 

Worley,K.C, Hale,S., Garcia, A.M., Gay,L.J., Hulyk,S.W., 

Villalon,D.K. , Muzny, D.M. , Sodergren, E . J . , Lu,X., Gibbs,R.A., 

Fahey,J., Helton, E., Ketteman,M. , Madan,A. , Rodrigues , S . , 

Sanchez, A., Whiting, M. , Madan,A. , Young, A. C . , Shevchenko, Y. , 

Bouf fard, G. G. , Blakesley, R. W. , Touchman, J. W. , Green, E. D. , 

Dickson, M.C. , Rodriguez, A. C. , Grimwood,J., Schmutz,J., Myers, R.M., 

Butterfield,Y.S. , Kr zywinski , M. I . , Skalska,U., Smailus , D . E . , 

Schnerch,A., Schein,J.E., Jones, S.J. and Marra,M.A. 

Generation and initial analysis of more than 15,000 full-length 

human and mouse cDNA sequences 

Proc. Natl. Acad. Sci . U.S.A. 99 (26), 16899-16903 (2002) 

22388257 

12477932 

2 (bases 1 to 1864) 
Strausberg, R. 
Direct Submission 

Submitted (17-SEP-2001) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 
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source 



gene 



CDS 



misc feature 



Contact: MGC help desk 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: ATCC/DCTD/DTP 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium { LLNL) 
DNA Sequencing by: Genome Sequence Centre, 
BC Cancer Agency, Vancouver, BC, Canada 
inf o@bcgs c . be . ca 

Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, 
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, 
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo 
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven 
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline 
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, 
Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy, ' 
George Yang, Scott Zuyderduyn, Marco Marra. 

Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAL Plate: 34 Row: 1 Column: 22 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 21040358. 

Location/Qualifiers 

1. .1864 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="MGC: 23029 IMAGE : 4868925" 

/tissue_type="Skin, melanotic melanoma, high MDR. " 

/clone_lib="NIH_MGC_4 9" 

/lab_host="DH10B-R" _ 

/note= "Vector : pOTB7" 

1. .1864 

/gene="BACE2" 

/note="synonyms: DRAP, ALP56, AEPLC, CEAP1, ASP1, ASP21 
BAE2, CDA13 " 

/db_xref= "Locus ID: 25825" 
/db__xref="MIM: 605668" 
92. .1648 
/ codon_start=l 

/product="beta-site APP-cleaving enzyme 2, isoform A 
preproprotein" 

/protein__id="AAHl4453 . 1" 
/db_xref="GI: 15680204" 
/db_xref= "Locus ID: 25 82 5" 

/ trans la tion="MGALARALLLPLLAQWLLRAAPELAPAP FTLPLRVAAATNRWA 
PTPGPGT PAERHADGLALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPP 
QKLQILVDTGSSNFAVAGTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGE 
DLVTI PKGFNTSFLVNIATI FESENFFLPGI KWNGILGLAYATLAKPSSSLETFFDSL 
VTQANIPNVFSMQMCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQI 
EILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSD 
GFWTGSQLACWTNSETPWSYFPKI S I YLRDENS SRSFRITI LPQLYIQPMMGAGLNYE 
CYRFGISPSTNALVI GATVMEGFYVI FDRAQKRVGFAASPCAEIAGAAVSEI SGPFST 

EDVASNCVPAQSLSEPILWIVSYALMSVCGAILLVLIVLLLLPFRCQRRPRDPEWND 
ESS LVRHRWK " 

362. .1381 



/note-"asp; Region: Eukaryotic aspartyl protease. Aspartyl 



(acid) proteases include pepsins, cathepsins, and renins. 
Two-domain structure, probably arising from ancestral 
duplication. This family does not include the retroviral 
nor retrotransposon proteases (pfam00077), which are much 
smaller and appear to be homologous to a single domain of 
the eukaryotic asp proteases" 
/db_xref= M CDD:pf am0002 6" 

ORIGIN 

Query Match 98.0%; Score 1768.2; DB 9; Length 1864; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 1770; Conservative 0; Mismatches 3; Indels 0; Gaps 0 



Qy 


1 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGrrrc 

N 1 I 1 1 1 I 1 1 1 M 1 1 1 1 1 M 1 i 1 1 1 1 1 1 1 M 1 1 I 1 M 1 1 1 1 1 1 1 I 1 I | M 1 1 M 1 1 1 1 1 ! 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


o u 


Db 


92 




Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAr 

INI 1 1 1 1 II 1 1 | | | || | | | | | | | | | | | | | | i i | | || | | | | M | , | , , | | , | ,| | 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


i on 

1ZU 


Db 


152 


9 11 
ill 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 | | M 1 1 M 1 1 i 1 1 1 1 1 1 1 1 | | | M 1 1 1 1 1 I I I I | M 1 1 1 1 1 1 1 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


i ft n 

-L O U 


Db 


212 


971 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

' 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 I 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 ( 1 1 1 1 1 1 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


9 4 0 


Db 


272 


j j i 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 

1 f M 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | 1 1 1 1 1 1 1 1 1 1 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 
-j \j \j 


Db 


332 


391 
•j -/ _i_ 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 

> 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I I I | M 1 1 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 


360 

-—j \j \J 


Db 


392 


451 

^j. 


Ov 


3 f\ 1 

o u ± 


AL *~ u 1 C J. ACAT AGAC AC GT AC T T T GAC ACAGAGAG GT C T AGC ACAT AC C GCT C C 

1 1 1 M M 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | 1 1 1 1 1 1 1 1 1 M 1 

AC C C C GCAC T C CT ACAT AGAC AC GTAC T T T GAC ACAGAGAG GT CT AG C AC AT AC C G CT C C 


420 


Db 


452 


511 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 I I | | | M 1 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | | 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


512 


571 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 

N I 1 1 1 1 1 1 1 1 I 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


572 


631 


Qy 


541 


T T T GAAT CAGAGAAT TTCTTTTTG C CT GGGAT T AAAT GGAAT GGAAT AC TT GG CCTAG C T 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | 1 1 1 1 1 1 1 1 1 1 | | | | 1 1 1 1 1 1 1 1 1 1 1 1 

T T T GAAT CAGAGAAT TTCTTTTTGCCTG GGAT T AAAT GGAAT GGAAT AC TT GGC CTAGC T 


600 


Db 


632 


691 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 

N 1 1 1 M 1 1 1 1 1 I I | 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 II 1 1 1 1 1 1 I I I I 1 1 1 1 1 1 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


692 


751 



661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 720 

I I M I I If I I f I | | | | I | | f | | | | M I I I I I I I | | | | | | | | f f | | | ! | | | | | | | | | | | j | 
752 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 811 

721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 78 0 

N I M I I I I I I I I I I I | | | | | | | | I I I ! I II I I I I I I I I I M I I I M I M II I I II I I I I 
812 GGAT C T GG GAG C AAC G GAG GT AGT C T T GT C T T G GGT G GAAT T GAAC C AAGTT T GTAT AAA 871 

781 GGAGACAT CT GGT AT ACCCCTATTAAGGAAGAGT GGTACTACCAGAT AGAAATT CT GAAA 840 

N I I I I I I I I I I I I I I I I I I | I I I | M I I II I I I I I | | | | | | | | I I I I I | | | | | | | | I I I 

8 72 GGAGACAT C T G GTAT AC C C C TAT TAAG GAAGAGT GGT ACT ACC AGAT AGAAAT T CT GAAA 931 

841 T T GGAAATT GGAGG C CAAAGC C T TAAT CT G GACT G CAGAGAGT AT AAC GC AGACAAGGC C 9 00 

I I I M I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I | | | | I I I I M I I I I I I I 

932 T T G GAAAT T G GAG GC CAAAG C CT TAAT C T GGACT G CAGAGAGT AT AAC GC AGACAAG GC C 9 91 

9 01 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

I ' I I I I I I I I I I I I I II j I I I I I I I I I I I I I I I I I I I I | | | | | | I I I I M I I I I II | I | | 
9 92 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1051 

9 61 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 
I N I I I I I I I I I I I I I M | | | | | I I I I I I I I I I I I I I I I I I I I I | | I I I I I II I f I I I I I 
1052 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1111 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | I I I I I I I I I I I I I I I | I I I I I I I I I 
1112 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1171 

1081 T AC CT GAGAGAT GAGAACT C C AGC AG GT CAT T C C GTAT CACAAT C C T GC CT CAGC T TT AC 114 0 

> I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I | | | | | j I I I I I I I I I 

1172 TAC CT GAGAGAT GAGAAC T CC AG C AGGT CAT T C C GT AT CACAAT C CT GC CT CAGCT T T AC 1231 

1141 ATT CAGC CCAT GAT GGGGGCCGGCCTGAATTAT GAAT GTTACCGATTCGGCATTTCCCCA 12 00 

MM I I I I I I I I I I I I I I I I I I I I | | | | I I I I I I I I I I I I I I I I I I 1 I I I 

1232 ATT CAGCCCATGAT GGGGGCCGGCCTGAATTAT GAAT GTTACCGATTCGGCATTTCCCCA 12 91 

1201 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1260 

I > M I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I I I I I I I | I if I I I M I I I I I I I I I 
12 92 T C C ACAAAT G C GCT G GT GAT C G GT G C CAC GGT GAT GGAGGG C T T C TAC GT CAT CT T C GAC 1351 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

N I M I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
1352 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1411 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | I I I I I I I | | | | I I I I I I I I 
1412 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 14 71 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

I I I M I I I I I I I I I I I I I I I I I I I | | | | M | | | | | | | | I I I I I I I M I I I I I I I I I I I I I 
14 72 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1531 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 15 00 

M I I I I I I I I I I i I I II I I I I I I I I I I I | | | | | | I | | | | || 

1532 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1591 
1501 C GT GAC C C T GAG GT C GT CAAT GAT GAGT CCTCTCTGGT CAGACAT C G CT GGAAAT GAAT A 1560 



Db 


1592 


Qy 


1561 


Db 


1652 


Qy 

—i 


1621 


Db 


1712 


Qy 


1681 


Db 


1772 


Qy 


1741 


Db 


1832 



' ' i M I I I I M I I I I I f I I I I I I I I I I I f I I I I I I | I | I | ! | | | M I I I I ! I I I I I | j I I 

C GT GAC C C T GAG GT C GT CAAT GAT GAGT C C T CT CT GGT C AGAC AT C GCT G GAAAT GAAT A 1651 

GC C AGGC CT GAC C T C AAG CAAC CAT GAACT C AGC T AT T AAGAAAAT C ACATT T C CAGGGC 162 0 

I I I I I I I I I I I I I I I I I I I | I I I I i I I I I I I II II II I I I I I I I I | | | | | 

GC C AG GC C T GAC CT CAAGCAAC CAT GAACT C AG CTAT TAAGAAAAT C ACAT T T C CAGG GC 1711 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 1771 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

' I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I | I I I I I I I I I I I I I I I I I | I I I I I I I I 
GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 18 31 

CTCCCTACTTCCAAGAAAAATAATTAAAAAAAA 177 3 
t M I M I I I I I I I I I I I I I I I I I I | | | | | | 
CTCCCTACTTCCAAGAAAAAAAAAAAAAAAAAA 18 64 



RESULT 12 

BD231786 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BD231786 1873 bp DNA linear PAT 17-JUL-2003 

Metastatic breast and colon cancer regulated genes 

BD231786 

BD2317 8 6. 1 GI : 33 041556 
JP 2002513542-A/18. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1873) 
Xin,H. and Giese,K. 

Metastatic breast and colon cancer regulated genes 
Patent: JP 2002513542-A 18 14-MAY-2002; 
CHIRON CORP 

OS Homo sapiens (human) 
JP 2002513542-A/18 
14-MAY-2002 

24-DEC-1998 JP 2000526659 
31-DEC-1997 US 60/070112 
HONG XIN, KLAUSE GIESE 

C12N15/00, C07K14/47,C07K16/18,C07K19/00,Cl2N9/64,Cl2N15/09 PC 
C12Q1/68, 

PC G01N33/50//A61K45/00,A61P35/04,C12N15/00,C12N15/00 CC 
Metastatic breast and colon cancer regulated genes FH Key 
Location/Qualifiers 
source 1. .1873 

/organism= ' Homo sapiens (human) 1 . 
Location/Qualifiers 
1. .1873 

/organism="Homo sapiens " 
/ mol_type= "genomi c DNA" 
/db xref="taxon: 9606" 



PN 
PD 
PF 
PR 
PI 
PC 



FT 
FT 



ORIGIN 



Query Match 



98.0%; Score 1768.2; DB 6; Length 1873; 



Local Similarity 99.8%; Pred. No. 0; 
hes 1770; Conservative 0; Mismatches 3; Indels 0; Gaps 



0 



101 
61 
161 
121 
221 
181 
281 
241 
341 
301 
401 
361 
461 
421 
521 
481 



ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

' >> I M I I I i I I I I I I I I I | | | | | M I I I I I I I I M I If I I I I I I I I I I I t I I I ! I I I I I 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 



160 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 
"> I I I I I I I I I I I I I I I I I I II I I I I If I I f I I | | M | | | | | | | | | | | | | , | 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 22 0 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 
1 1 ' I I I I I I I f I I I I I M I I I M f I I I I I I I I II f I I I I I I I I I I I I I I I I I I I I I I I I I 
CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 2 8 0 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 
1 f I M I I I I I I I I I | | | | f I I I I I I I | | | | | | | I I I I I I I | | | | | | | | | I I I i I I I M | | 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 34 0 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

"> I I I I I I M I I I I I I I I | | | | | | | ! I I I I I I j I M I I I I | I I i I I I I I I I | | | | | | M 
GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 4 00 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 
' > I I I I I I I I I I I I I I I I I I I I I I | | | | | I I I If I | f | | | I M II I I I I I I | | | | | | I I I 
CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 4 60 

AC C C C G C ACT C C T AC AT AGACAC GT ACT T T GAC AC AGAGAGGT CTAG CAC ATAC CG CT C C 42 0 

"' I I I I I I I I I I I I I I I I | | | | | | | | | | | | I I I I I I I I I I I | | | I I I I I I | | | | I I I I I 
AC C C C GC ACT C CT AC AT AGACAC GT ACT T T GAC AC AGAGAG GT CTAG CAC ATAC C GC T C C 52 0 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 8 0 

' 11 I I I I I I I I I I I I | | | | | | | | | | | | | I I I I I I I | I I I I If I I I | | I I I I I I | | | | | II 
AAGG GC T TT GAC GT C AC AGT GAAGT ACAC ACAAG GAAG CT G GAC GG G CT T C GTT G G GGAA 580 

540 



GAC CT C GT CAC CAT C CC CAAAGGC T T CAAT ACT T CTTTTCTT GT CAACAT T GCCAC TAT T 

' ' 1 > f I I I I I | | | | | | | | | | | | | | | | I I I I I I | | | | If | | | f I I I I If I | I I I I I I I I I I 
581 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 64 0 



541 
641 
601 
701 
661 
761 
721 
821 



T TT GAAT CAGAGAAT TT CTTTTTGC CT G GGAT TAAAT G GAAT G GAAT ACT TGGC CT AGCT 60 0 

1 > I M If I I I I I I I II I I I I I I I I I If | | | | M I I I I I I I I f I I I I I f I I I | | | | | | | I I 

TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 



TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 
1 1 1 ' I I I I I I I I I I f I I I I I I I I I I I I I I f I I f f f I I I | | | I I I I I I f | | | | I If I I I I f 
TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 

1 1 ' 1 ' I I I I I I If I I I I | | | | | M I I I I II I I f f I I I I | | 1 f | | | | | | | | | | M I I I I I I 
CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 

G GAT CT G G GAC CAAC G GAGGT AGT CT T GT C T T G GGT GGAAT T GAAC CAAGT T T GT ATAAA 

1 1 1 1 I I I I II I I I I I I I I | | ( f | | | M I I I I I I I I I f I I I I f I I | | | f I I If I I I f I I I I 
GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 



700 
660 
760 
720 
820 
780 
880 



781 GGAGACAT CTGGTATACC C CTATTAAGGAAGAGTGGT ACTAC CAGAT AGAAATT CT GAAA 84 0 
1 1 1 ' 1 1 ' I I I f I I I I f I I | | I | | | | | I I I I I If I | | I f I I I M I I | | f i | | | | | | M I I I 



8 81 G GAGAC AT CT GGT AT AC C CCT AT TAAGGAAGAGT GGTAC T AC CAGAT AGAAATT CT GAAA 94 0 

841 TT GGAAAT T GGAG G C CAAAG CCT TAAT CT GGAC T GCAGAGAGT AT AAC G C AGACAAGG C C 900 

I I f N I I I I I I I f I I I I I I M I I I I ! I I I I I | | | f | | | | M I I I I I | I | | | | | | | | | | | | 
941 TT GGAAAT T GGAGGC CAAAGCCT TAAT CT GGAC T GCAGAGAGT AT AAC GC AGACAAGG C C 1000 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 
I I I I I t I I I I I M I I I I I I I f I I I I | I I i I I I I II I I | | | | | | | I I I I I I I | I | | | | I I I 
10 01 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1060 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 
I I I I I M I I I I M I I I I II ! I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I f I I I 
1061 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1120 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

> I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I | | | | | | | | | I I I I 
1121 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 118 0 

1081 T AC CT GAGAGAT GAGAAC T C C AGC AGGT C ATT C C GT AT C ACAAT C C T G C C T C AGCTT T AC 114 0 

I I I I I I I I I I I II I I M I I I I I I I I I I I I I I M I I I I I I | | | | | | | | | | 

1181 T AC CT GAGAGAT GAGAACT CCAGCAGGT CATT CCGTATCACAAT CCTGCCT CAGCTTTAC 1240 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1200 

I I I I I I I I I I I I I I | | | | | M I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
1241 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1300 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

" M I I I I I I I I M I I I I I | | | | | | | | | | | | | | | M I I I I j I I I I I I I I I I M I I I I I M 

13 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1360 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1320 

> N I I I M I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | (I I f I I I I I I | | | | | | | M 

13 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 142 0 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 1380 

>> M I I I I I I I I I I I I f M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1421 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 14 8 0 

13 81 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

' I I I I I I I I I I I I | | | | | | | | (I I I I I I I I | | | | | M I I I I I I I | | | I I I I I I I I I I I I | 

14 81 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 154 0 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 15 00 

I I M I I I II I I I I I I I I I I I I I | | | | | I I I I 1 I I I I I I I | | | | | | | | I I I I I I I I I I I I I 
1541 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1600 

1501 C GT GAC CCT GAG GT C GT CAAT GAT GAGT CCTCTCTGGT CAGACAT C G CT GGAAAT GAAT A 1560 

' I I I I M I I I I I I I I M If I I I I I I I I M I I I I I I I I | | | | | | | | I I I I I I I I I II I I I I 
1601 C GT GAC C CT GAG GT C GT CAAT GAT GAGT C C T C T CT GGT CAGACAT C GCT GGAAAT GAAT A 1660 

15 61 G C C AG GC CT GAC CT C AAGCAAC CAT GAACT CAG CTAT T AAGAAAAT C AC AT T T C CAGG G C 162 0 

M I I I I I I I I I I I I I I I | | | I I I I I I I I I I I I I I I I I I I I I I I I I 

1661 G C CAG G C C T GAC C T C AAG C AAC CAT GAAC T CAG C TAT T AAGAAAAT C AC AT T T C CAG G G C 172 0 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

III'' M I I I M I II I I I I I I I I M | | | | | I | | I I I I I I I II I I I I I I 

1721 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 17 8 0 



Qy 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 
, no , HI" INI I I III Ml || || IN IIIIM | M| | Mm ,,, ,,,,,, 

Db 1781 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 184 0 

Qy 1741 C T C C C T ACT T C C AAGAAAAAT AAT T AAAAAAAA 1773 

N II I I I I II I I | | | | | | | | M | | | | | M I 
Db 1841 CTCCCTACTTCCAAGAAAAAAAAAAAAAAAAAA 1873 

RESULT 13 
AR411231 

LOCUS AR411231 i 8 73 bp DNA linear PAT 18~DEC-2003 

DEFINITION Sequence 18 from patent US 6635748. 

ACCESSION AR411231 

VERSION AR411231.1 GI:40163285 
KEYWORDS 

SOURCE Unknown . 

ORGANISM Unknown. 

Unclassified. 

REFERENCE 1 (bases 1 to 1873) 

AUTHORS Xin,H. and Giese,K. 

TITLE Metastatic breast and colon cancer regulated genes 

JOURNAL Patent: US 6635748-A 18 21-OCT-2003; 
FEATURES Location/Qualifiers 
source 1. .1873 

/ organism="unknown" 
/mol_type=" genomic DNA" 

ORIGIN 

Query Match 98.0%; Score 1768.2; DB 6; Length 1873; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 1770; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

" 11 N I I I II I I I I I | | M | | | | | M I I I I I I M I I I I I I l l I I I I I I i i i i i 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 160 
GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

' N I I I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | M I M I I I I I 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 220 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

M N I I I I I I | | | | | | I I I I I I I I I I I I I I I I M I I I f I I I | | I I I I I I I I I I I I 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 2 8 0 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 

N I I I I I I | | | | | | I I I I I I I I I I | | | ft I I I I I I II I I I I I I | | | | | M I I I 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 34 0 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 3 00 

1 1 11 N I I I I I I I I I I I I I I I | | I | I I I I I I I I M I I I I I I | | | | I I I I I I I I I 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 4 0 0 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 

1 1 N I M N I I I I I I f I I I I I I I I | I I I I I I I I I I I M I I I I I I I I I I 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 4 60 



Qy 


1 


Db 


101 


Qy 


61 


Db 


161 


Qy 


121 


Db 


221 


Qy 


181 


Db 


281 


Qy 


241 


Db 


341 


Qy 


301 


Db 


401 



361 AC C C C GC AC T C C TACAT AGACAC GT ACT T T GAC ACAGAGAG GT CT AG C ACAT AC C GC T C C 42 0 

1 " I I I I I I I I f I M I I I I I I M I I I I f f I I I | | | | | | | | | | | | M | | | | | | | | | | | | | f 
4 61 AC C C C GC ACT C C TACAT AGACAC GT AC T T T GAC ACAGAGAG GT C TAG C ACAT AC C GCT C C 52 0 

421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 80 

' 1 I f I I | | I | | | | | | | | | | | I I I I II I I I | | | I | | | | | | | | | | | M M I I I I I I I 

521 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 58 0 

481 GAC C T C GT C AC CAT C CC CAAAGGCT T CAAT ACT T C T T T T C TT GT C AAC AT T GC C ACT AT T 54 0 

I ' I I I I I I I I ' I M I I M I I I I I I I I I I I I I II I II II I I II I I I I I I I I I I | | | 

581 GAC CT C GT C AC CAT C C C CAAAGGCT T CAAT AC TTCTTTTCTT GT CAAC AT T GC CAC TAT T 64 0 

541 TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 600 

M I I I I I I I I I I I I i M I I I II I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I 

641 T T T GAAT C AGAGAAT TT CT TT TT GC CT G GGAT TAAAT GGAAT GGAATACT T G GC CT AG C T 700 

601 TAT G C CAC AC TT GC CAAGC CAT CAAGT T CT C T GGAGAC C T T CT T C GACT C CCT G GT GAC A 660 
' 1 ' I I I I I I I I I | I | | | | | | | | | | | | | | | | | | | | | | | | | | | , , , , | , , | | ,,,,,,,,, , 

7 01 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 7 60 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 
' 1 I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | I I I I I I I I I I I I I 

7 61 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 820 

721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 780 

" I I I M I I I I I I I | | I I I I I I I M I I I I I I I | | | | | | M M | | | | | | | | | | | || 

821 GGAT CT GGGAC CAAC GGAGGT AGT CT T GT CT T GG GT GGAAT T GAAC CAAGT T T GT ATAAA 880 

781 GGAGAC AT CT GGT ATAC C C C TAT T AAGGAAGAGT GGTAC T AC C AGATAGAAAT T CT GAAA 84 0 

' I I I I I I I I I I f I I M M I I I I I I I I I M I I I I I I I I | I I I I I I I I I I I I | | | I I 

8 81 GG AGACAT C T G GT AT AC C C CT AT T AAG GAAGAGT GGTAC T AC C AGATAGAAAT T C T GAAA 94 0 

841 T T GGAAAT T G GAG G C CAAAG CCT T AAT C T G GAC T GC AG AG AGT AT AAC G CAGACAAG G C C 900 

I 1 M I I I I I I I I I I | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

941 TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 1000 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 9 60 
1 ' 1 I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | I I I I I I | | M I I I 
1001 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 10 60 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

I I I ' I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | I I I I I I I I I I I I | I 

1061 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 112 0 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | I | | | | | | | | | | | | | | | | | | | | | 

1121 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 118 0 

1081 TAC C T GAGAG AT GAGAAC T C C AGC AGGT CAT T C C GTAT CACAAT CCT GC CT C AGCT T T AC 114 0 

1 ' 1 " M I I I I I I I I I I I | | | | f | | | I I I I I I I I | | (Mill If 

1181 TAC C T GAGAGAT GAGAAC T C C AGC AGGT CAT T C C GTAT CACAAT C C T GC CT C AG CT T TAC 12 4 0 

1141 ATT CAGCCCAT GAT GGGGGCCGGCCTGAATTAT GAAT GTTACCGATTCGGCATTTCCCCA 12 00 

1 1 " I I I I I I I I I I I I I I I | | M | | | | | I I I I I I I | | | | | | I I I I I I I I I I I I I I 

12 41 AT T C AGC C CAT GAT G GGG GC C GGC CT GAAT TAT GAAT GT TAC C GAT T C GGC AT T T C C C C A 1300 



Qy 


1201 


Db 


1301 


Qy 


1261 


Db 


1361 


Qy 


1321 


Db 


1421 


Qy 


1381 


Db 


1481 


Qy 


1441 


Db 


1541 


Qy 


1501 


Db 


1601 


Qy 


1561 


Db 


1661 


Qy 


1621 


Db 


1721 


Qy 


1681 


Db 


1781 


Qy 


1741 - 


Db 


1841 < 



" ' I I I I I ! I I I I I I I I I I I I I I I ! I I I M I I I M I I I I I I M I I I I I I I I I I 

TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1360 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

' I I I I I I I I I I I I I I I I I I I I I | | > M I I I I I | | I I I I I I I II I I I I I I I I I I I I 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGTVAATTGCAGGTGCTGCA 142 0 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 13 8 0 

>' I M I I I I I II I I I I I I I I II I I I I I I I I I I I I I | M | | | | I I I I I I I I I | | M | II I I 
GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 14 8 0 

CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 
' I I I I I I I I I I I I M I I I I I I I I I I I I | | I I I I I I I I I I I I I I I I | | | | I I I I I I | | | | | 
CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 154 0 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 
N M I I I I I I I I I I II II I I I I I I | | | | | | | | | | | | || I I I I I I I I | | | | | | | | | 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1600 
C GT GAC C CT GAGGT CGT CAAT GAT GAGT C C T CT CT G GT C AGAC AT C GC T G GAAAT GAAT A 1560 

1 ' I I I I I I I M I I I I I I I I I I I I | | | I I I II I I I I M I I I | I I I I I I I I I I | | | | | | | | | 

C GT GAC C C T GAGGT CGT CAAT GAT GAGT C CTCT C T GGT C AGAC AT C GCT GGAAAT GAAT A 1660 
G C CAGGC C T GAC CT CAAGCAAC CAT GAACT CAG C TAT T AAGAAAAT C AC AT T T C CAG GG C 162 0 

1 I I I M I I I I I I I I I I I I I | | I I I I I II f I I I I I I f I M I I I I I I I I I I I I I I I I I I I I f 

GC CAGGC CT GAC CT CAAG CAACC AT GAAC T CAGCT AT TAAGAAAAT C AC AT T T C C AGGGC 172 0 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

'< I I ' I I M M I I I I I I I I f I I I | | | | | | | | | I M I I I I I I I I I I I I I I I I I I I M I I M 
AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 17 8 0 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

' ' I M I I I I I I I I I I I I I I I I I I I I I I I I I | | | I I I If I I I I I I I I I I I I I I M I I I I I I 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 18 4 0 

CTCCCTACTTCCAAGAAAAATAATTAAAAAAAA 1773 
M M I I I I I I I I I I I II I I I II I I I I I I I I 
CTCCCTACTTCCAAGAAAAAAAAAAAAAAAAAA 18 73 



RESULT 14 

AF117892 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



AF117892 1873 bp mRNA linear PRI 14-JUL-2000 

Homo sapiens aspartic-like protease mRNA, complete cds 

AF117892 

AF117 8 92. 1 GI : 55 65 8 65 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo 
1 (bases 1 to 1873) 

Xin,H., Stephans, J.C. , Duan,X., Harrowe, G . , Kim, E . , Grieshammer , U . , 
Kmgsley,C. and Giese,K. 

Identification of a novel aspartic-like protease differentially 
expressed in human breast cancer cell lines 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

FEATURES 

source 



CDS 



Duan,X., Harrowe,G. , Kim, E . , Grieshammer , U . 



Biochim. Biophys . Acta 1501 (2-3), 125-137 (2000) 
20298348 
10838186 

2 (bases 1 to 1873) 
Xin,H. , Stephans, J.C. 
and Giese,K. 
Direct Submission 

Submitted (05-JAN-1999) Technologies, Chiron Co., 4560 Horton St 
4.5103, Emeryville, CA 94608, USA 

Location/Qualifiers 
1. .1873 

/ organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/cell_type=" stroma " 
/tissue_type="bone marrow" 
101. .1657 

/ codon_start=l 

/product="aspartic-like protease" 
/protein_id="AAD45240 . 1" 
/db_xref="GI : 5565866" 

/ translation— "MGALARALLLPLLAQWLLRAAPE LAP AP FTLPLRVAAATNRWA 
PTPGPGT PAERHADGLALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPP 
QKLQILVDTGSSNFAVAGTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGE 
DLVTIPKGFNTSFLVNIATIFESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSL 
VTQANIPNVFSMQMCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQI 
EILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSD 
GFWTGSQLACWTNSETPWSYFPKISI YLRDENSSRSFRITILPQLYIQPMMGAGLNYE 
CYRFGI SPSTNALVIGATVMEGFYVI FDRAQKRVGFAAS PCAEIAGAAVSEI SGPFST 

EDVASNCVPAQSLSEPILWIVSYALMSVCGAILLVLIVLLLLPFRCQRRPRDPEWND 
ESSLVRHRWK" 



ORIGIN 



Query Match 98.0%; Score 1768.2; DB 9; Length 1873; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 1770; Conservative 0; Mismatches 3; Indels 0; Gaps 



0; 



Qy 


1 


Db 


101 


Qy 


61 


Db 


161 


Qy 


121 


Db 


221 


Qy 


181 


Db 


281 


Qy 


241 


Db 


341 



ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 
1 1 1 1 '< I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | M | | | | | | 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 160 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 120 

1 1 1 I I I I I I I I I I I I I I I I I I I M | | | | | I I I I I I I I I I II I I I I I I I I | | | | I I 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 220 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 
1 1 11 ' N I I I I I I I I I I I I | | | | || I I I I M I I I I I I I I | | | | | | | | || | | | | | 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 2 80 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 2 4 0 

1 1 11 I < I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I | | M | | | | II I I I I I I I I II 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 34 0 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

1 I I I N I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I I I I I II I I I I 
GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 4 00 



301 



C C C C C G C AGAAG C T AC AGAT T C T C GT T GAC AC T G GAAGCAGTAAC T T T G C C GT GGCAGGA 3 60 

1 11 1 I I I I I I I I M I f I I I I I I I I I f | | | | | | | M | | | | || | | , | ,,,,,,,,,,, |M , 

4 01 C C C C C G CAGAAG C TAG AGATT C T C GT T GAC AC T G GAAGCAGTAAC TT T G C C GT GGC AG GA 4 60 



361 

461 

421 

521 

481 

581 

541 

641 

601 

701 

661 

761 

721 

821 

781 

881 



AC C C C G CACT C C T AC AT AGAC AC GTAC T T T GAC ACAGAGAGGT CT AGCAC AT AC C G CT C C 42 0 
''''' I ' I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
AC C C C G CACT C C T AC AT AGACAC GTAC T T T GAC AC AGAGAGGT CT AG CAC AT AC C G CT C C 52 0 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 8 0 
' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I | | | || | 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 5 8 0 

GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

f 1 M ' I I I I I ! I I I I I I I I I I I I | | | | | | | | I I I | | I I I I I I II I I I I I | | | | | I II I I I 
GAC C T C GT CAC CAT C C C CAAAGG CT T CAATACTT CTTTTCTTGT CAACATT GC CACT AT T 64 0 

T T T GAAT C AGAGAATT T CTTTTTGCCT G GGAT TAAAT GGAAT GGAAT ACT T GGC CTAG CT 600 

1 1 I I I M I I I I I I I I I I I I I I I I I I I I I I | | | | | I I I I I I I | | | | | I I I I I I I I I I I 

TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 7 00 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 

1 1 1 1 "' I I I I I I I I I I I I I I I I I M I I I I M I I I I I M I I II I I I I I I I I I I I I 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 7 60 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 
' 1 M 1 > I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | ,| | , ,,,,,,,,,,,,,, , 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 82 0 

GGAT C T GG GAC CAAC G GAGGT AGT CT T GT C T T G GGT GGAAT T GAAC CAAGT T T GT AT AAA 7 8 0 

1 1 1 1 '< I I I I I I I I I II I I I I I I I | | | | | | | | | | | | | I I I I I I I I I I I I | | | | I I I I I I I 
GGAT C T GGGAC CAAC GGAGGT AGT C T T GT C TT G G GT G GAAT T GAAC CAAGT T T GT ATAAA 8 8 0 

G GAGAC AT C T GGT AT AC C C C TAT T AAGGAAGAGT G GTAC T AC C AGAT AGAAAT T CT GAAA 8 4 0 

1 11 1 1 > I I I I I I I I I I I I I I | | | I I I I I I I I I I I I I I I | | | | | | | | | M I I I I | | | | | | | 
GGAGAC AT CT GGT AT AC C C CT AT T AAGGAAGAGT GGT ACT AC CAGAT AGAAATT CT GAAA 94 0 



841 T T GGAAAT T GGAGGC CAAAGC CT T AAT CT GGAC T GCAGAGAGT ATAAC G C AGACAAGGC C 900 

' 11 ' 1 1 1 ' I I I I f I I I I I I I M I I I I I I I I I I I | | | | | | I I II I I I I I I | | | | I I I I I I I 
941 T TG GAAAT T GGAGGC CAAAGC CT T AAT C T GGACT GCAGAGAGT ATAAC G C AGACAAG< 



;gcc iooo 



901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGT 



1001 
961 
1061 
1021 
1121 
1081 



GGTG 9 60 

1 1 1 " 1 ' I I I I I I I II I I I I I I I | | | M I M I II I I I I I I I I I | | | | I I I I | | Ml | | | | 

ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1060 

GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCT GAT GGTTTCT GGACT GGGTCC 102 0 

1 1 1 11 11 > I I I I I I I I I I I I I | I I I I I I I | | | I I I I I I | | | | | | | | I I I I I I I | | j | | I I 
GAAGCTGT GGC CCGCGCATCTCTGATTCCAGAATTCTCT GAT GGTTTCT GGACT GGGTCC 112 0 

CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

f I M I I I I I I I I | | | | | | | | I I I I I I I I I I I II M I I I I I I I I I I I I I I I 

CAGCT GGC GTGCT GGAC GAATTCGGAAACACCTT GGT CTTACTTCCCTAAAATCTCCATC 118 0 



T AC CT GAGAGAT GAGAAC T C CAGC AGGT CAT T C C GT AT CAC AAT C CT GC CT C AGCT T T AC 114 0 

LI 111 ii ,,M 1 " 1 ' 1 1 1 1 ' 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

C 12 4 0 



1181 T AC C T GAGAGAT GAGAACT C CAGC AGGT CAT T C C GT AT CACAAT C CT G C C T C AG CTT T A 



ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 00 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I If I i I I I I I I I M I I I I I I I I 

Db 1241 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1300 



Qy 1201 



TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTT 



CGAC 12 60 



1 M 1 1 ' 1 1 1 1 1 ' N M M I I I M | | | | | | M I I I I I | | | | | | | | | | | | | | | | I M I i I I, 
Db 1301 TCCACAAATGCGCTGGT GAT CGGTGCCACGGT GAT GGAGGGCTTCTACGTCATCTT CGAC 1360 

Qy 12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

1 1 1 " 1 ' I ' I I I I I ' I I I I I I I I I I I I I I I I I I I I I I I I I I I | M I M I I I || | | | I I I | 
AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1420 



Db 1361 

Qy 1321 

Db 1421 

Qy 1381 

Db 1481 

Qy 1441 

Db 1541 

Qy 1501 

Db 1601 

Qy 1561 

Db 1661 

Qy 1621 

Db 1721 

Qy 1681 

Db 17 81 



GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 1380 

' 1 ' 11 1 ' 11 1 I I I I I I I I I I I I I | II I I I I M I I I I I I I I I I I M I I I I I I I I I I I | | | I 
GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 14 80 

CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

' 1 ' ' ' 1 1 11 > I I II I I I I I I M M I I I I I I I I I I I II I I | f I I I I I I I I | | I I I I I II I I 
CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 154 0 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

' 1 1 M 1 1 1 ' I I M I I I I I I I | | M I I I I I I I | | | | | | | | | | | | | [ I I | | | | I I I I I I I I I 
GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1600 

C GT GAC C C T GAG GT C GT CAAT GAT GAGT CCTCTCTG GT C AGAC AT C GC T GGAAAT GAAT A 1560 

1 1 1 11 11 1 1 ' > I I I I I I I | | | | | I I I I I I I I | | | | M I I I I I I I | | | | | | | | | | | | II I I 
C GT GACC CT GAGGT C GT CAAT GAT GAGT C CTCT CT GGT CAGACAT CGCT GGAAAT GAAT A 1660 

G C C AG GC CT GAC CT CAAGCAAC CAT GAAC T C AGC T ATTAAGAAAAT C AC AT T T C C AGGGC 1620 

1 1 1 1 M ' 1 > I I I I I I I I I I I I I I II I M I I I I I I I I I I I I I M I M I I I I I I I I | | | | II 

GC C AGG C CT GAC CT CAAGCAAC CAT GAACT C AGC T AT TAAGAAAAT C AC AT T T C C AGG G C 172 0 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

I I ' I I I I I I I I I I I I I I I | | | II I I I I I | | | | | | | | | | | | | | , | | | | , | | | , | , , 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 17 8 0 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

II 1 1 M 1 ' M > I I I I I I I I I I I I I I I | | I I I I I I I I I I I I I I | I I I I I I I I I | | | | | | | I 
GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 184 0 



Qy 1741 CTCCCTACTTCCAAGAAAAATAATTAAAAAAAA 1773 

I I I I I I I I I I I I | | | | | | | | || M | | | I I I 
Db 1841 CTCCCTACTTCCAAGAAAAAAAAAAAAAAAAAA 1873 



RESULT 15 

AR136909 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 



DNA 



AR136909 i 86 2 b p 

Sequence 1 from patent US 6162630 
AR136909 

AR136909.1 GI: 1447 815 9 



Unknown . 

Unknown . 

Unclassified . 

1 (bases 1 to 1862) 

Powell, D. J., Southan,C, Chapman, C.G. 
ASP1 



linear 



PAT 16-JUN-2 001 



and Evans , J . R 



JOURNAL 
FEATURES 

source 



ORIGIN 



Patent: US 6162630-A 1 19-DEC-2 000 ; 
Location/ Qualifiers 
1. .1862 

/ organism="unknown" 
/mol_type="unassigned DNA" 



Query Match 97 . 9% ; Score 1765.6; DB 6; Length 1862; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 1768; Conservative 0; Mismatches 4; Indels 0; Gaps 



0; 



Qy 1 at 



GGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGC 



GCC 60 



1 1 I I I I I I I I I I I I M I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | || , | | | , 
Db 91 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 150 



Qy 61 



GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCAC 



GAAC 12 0 



1 I I N I I I I I I M I I | M I I I M I I I I I I M | | | | | M | | || | | | || | | | | | I | | | , | , , 
Db 151 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 210 



Qy 121 



CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCT 



TG 180 



I 1 1 1 1 1 1 1 1 1 " 1 1 ' I I I I I M I I II I II I I I M I I I I I I I I I I I I I I I II I I I I I I M I 
Db 211 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 27 0 

Qy 181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 

II I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I II I II I I I I I | | | | I M | I I I I I 

Db 271 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 330 

Qy 241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 3 00 

1 1 ' N I I I I I I I I | | | II I II I I II I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 331 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 390 



Qy 3 01 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 

I N I I I I I I I I I I I I I I I | | I I I I I I I I I I I I I I I I I | | | | | I I I I I I M I I I I I I I I | | 
Db 3 91 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 4 50 

ACCCCGCACTCCTACATAGACACGTACTTTGACACAGAGAGGTCTAGCACATACCGCTCC 420 
„ c , 1 1 1 1 1 1 1 1 11 1 1 1 I I I I I I I I I I I I I | I I I I I I I | | | I I I I I I I I I I I I I I I I I I I I I I I 
Db 451 ACCCCGCACTCCTACATAGACACGTACTTTGACACAGAGAGGTCTAGCACATACCGCTCC 510 



Qy 361 



Qy 421 



AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 



480 



n , C11 111111 I I I I I I I I I I I I II I I | | | | | | | | | | | | || | | , | | | | | | | | | | 

Db 511 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 570 



Qy 481 



GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTA 



TT 54 0 



1 1 1 1 1 1 " 11 1 1 1 1 1 1 I I I I I I I I I | | | | | I I I I I I I I M I I I I I I I | | | | I I I I I I I I I 
Db 571 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 630 

Qy 541 TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 600 

n . 1 1 1 1 1 1 I I I I I I I I I I I | I I I I I I | I I I I I I I I I I | | | | | I I I I I I I | | | | | | | | 

Db 631 TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 690 

Qy 601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 

I I I I II I I I I I I I I I I | I I I I I I I I I I | | | | M I II I I I I I I I | | | I I I I I I I I I I I I I I 
Db 691 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCi 



ICTGGTGACA 750 



Qy 661 



CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 720 
11 1 1 11 I < I I I I I I M I I I I I I I M I I I I I I II I I I I I I I I I | I I II I I I I I I 



751 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCC 
721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTT 



GTTGCT 810 



GTATAAA 7 8 0 

Ulliliiiliiiiiiiiiiliilliiil 1 I I I II I I M I I I I I I | M Nl Ml | I | I I I 

TAT AAA 870 



811 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTG 



781 GGAGACATCTGGTATACCCCTATTAAGGAAGAGTGGTACTACCAGATAGAAATTCTGAAA 840 

I N I I I I II I I I I I I I I I I I I I I I M I II I I I I I I I I I I I I | M I I I I I I I I I I I M M I 
871 GGAGACATCTGGTATACCCCTATTAAGGAAGAGTGGTACTACCAGATAGAAATTCTGAAA 930 

841 TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 900 

Q ,, 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I | | | | I I I I I | | I | 

931 TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 990 

901 



ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

[iiiiiii iiiiiiiiiii iiiliilii [ ! ! ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGTG 1050 



991 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGT 



961 
1051 
1021 
1111 
1081 
1171 
1141 
1231 
1201 
1291 
1261 
1351 
1321 



GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 107 0 

i'li'lMIIIIIIIIIIIIIIIIIIMIIIIIIIIMMIMIIIIIIIIIIIMtm 

GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1110 

CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1080 
i M 11 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I | M I I I I I I I I I I I I I I I I | 
CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1170 

TACCTGAGAGATGAGAACTCCAGCAGGTCATTCCGTATCACAATCCTGCCTCAGCTTTAC 114 0 
1 1 11 1 1 1 1 I ' I N I I I I I I I I I II I I | | M I I I I I I I I | | M I I I I I I I I | I II I I I I I 
TACCTGAGAGACGAGAACTCCAGCAGGTCATTCCGTATCACAATCCTGCCTCAGCTTTAC 1230 

ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 0 0 

'1 11 1 1 1 1 Mill Mil I MM Mill MM IMII MM MM M? 

ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 90 

TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1260 
ii ' ' 1 ' 1 1 11 1 1 1 1 1 1 I I M I II I I I || I | I II II I || || | I II II II II I I | | II II I I 
TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1350 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1320 
' 1 11 1 1 1 1 I I I I M II I II I || II I I II II II I II I I I I II I I II I II || I II I I I II II 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1410 



GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 13 80 

mil HIV M I I I I I II I I II I I | || II II II I II I || II I I II II II I II I II I || | 

1411 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 14 70 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

. An ^ 1 1 1 1 1 1 1 I M II I I I I II I II II II || || II I I II | | || M II II II II I I I II I 

1471 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1530 

1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

1 1 1 M II II I I || || II II II II I I II II || | | || || m II II I II || || I I I I l l l l I I 

1531 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1590 



1501 CGTGACCCTGAGGTCGTCAATGATGAGTCCTCTCTGGTCAGACATCGCTGGAAATGAATA 1560 
ill 1 1 1 1 1 1 I II II II II I M II II I I I II I II || || | | M II II I I I II II II 



1591 CGTGACCCTGAGGTCGTCAATGATGAGTCCTCTCTGGTCAGACATCGCT 



GGAAATGAATA 1650 



Qy 


1561 


Db 


1651 


Qy 


1621 


Db 


1711 


Ait 


1681 


Db 


1771 


Qy 


1741 


Db 


1831 



???t????y?t??T C ^ GC ^ CCATG ^ CTCAGCTATTAAGAAAAT CACATTTCCAGGGC 1 62 0 
''''''' I ' I I I I I I I I I I | | | | | | | | | | | | | | | | | | I I I | I I I I I I I I I I 

GCCAGGCCTGACCTCAAGCAACCATGAACTCAGCTATTAAGAAAATCACATTTCCAG^ 1710 
AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 1680 

I 1 1 1 1 N I I I I M I I I | I | | M I I I I I I | | | M I I I I | | | | | | | | I I | I I | I i I , i I | | | 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 1770 
f?^?^f??TT?T A ? A TT? ACTGTCTTTTGATTCTTGATTTTC ^ GCT TTCAAATC 1740 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATiiicAAGCTiicAAAic 1830 

CTCCCTACTTCCAAGAAAAATAATTAAAAAAA 1772 
I I I I I I I I I I I I | | | | | | | | || | | | | | | | 

CTCCCTACTTCCAAGAAAAAAAAAAAAAAAAA 1862 
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Scoring table: IDENTITY_NUC 
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Searched: 3373863 seqs, 2124099041 residues 
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Minimum DB seq length: 0 
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ALIGNMENTS 



RESULT 1 
AAA15661 

ID AAA15661 standard; cDNA; 1804 BP. 
XX 

AC AAA15661; 
XX 

DT 03-AUG-2000 (first entry) 
XX 

DE Human aspartyl protease 1 (Aspl) nucleotide sequence. 
XX 

KW Aspartyl protease; aspartase; amyloid precursor protein; APP; Asp 1; 

KW Alzheimer's disease; beta secretase site; ss . 

XX 

OS Homo sapiens. 



XX 

PN WO200017369-A2 . 
XX 

PD 30-MAR-2000. 
XX 

PF 23-SEP-1999; 99WO-US02088 1 . 
XX 

PR 24-SEP-1998; 98US-0101594 P . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Gurney ME, Bienkowski MJ, Heinrikson RL, Parodi LA, Yan R; 
XX 

DR WPI; 2000-303209/26. 

DR P-PSDB; AAY88424. 
XX 

PT New enzyme designated human aspartase useful in research into Alzheimer's 
Disease is capable of cleaving amyloid protein precursor at the beta 
secretase site to produce amyloid beta peptide. 



PT 
PT 
XX 

PS Claim 11; Fig 1; 183pp; English. 
XX 



CC This sequence represents the human aspartyl protease nucleotide sequence. 

CC The invention relates to a protease capable of cleaving the beta 

CC secretase site of amyloid precursor protein (APP) . The protease contains 

CC a sequence encoding the amino acid sequence DTG and a sequence encoding 

CC DSG or DTG separated by 100-300 amino acids. When mutated the APP gene 

CC causes an autosomal dominant form of Alzheimer's disease. APP localises 

CC to the cell surface membrane and have a single C-terminal transmembrane 

CC domain. Proteolytic processing of APP produces the amyloid beta protein, 

CC which is possibly very important in Alzheimer's disease. The invention 

CC includes a nucleotide sequence encoding the protease, a vector containing 

CC the nucleotide sequence, and a cell line comprising the vector. Methods 

CC for screening for inhibitors of beta secretase activity are also given in 

CC the invention. The human aspartase protein and nucleotide sequences and 

CC the methods for identifying inhibitors of the protease, are useful in the 

CC treatment of and research in to Alzheimer's disease 
XX 

SQ Sequence 1804 BP; 397 A; 520 C; 458 G; 429 T; 0 U; 0 Other; 

Query Match 100.0%; Score 1804; DB 3; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 
I I I I I I I I f I I f I I I I I I f I I I I I I I I I I I I f I I I I I I I II I I II I I I I | | | ( | | | | | | | 
Db 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 
I I I I N I I I I I I I I I I I I I I | | | | | | | | | | | I | | | | | | | | | | | | | | | | | | | | | | | | | | , | 

Db 61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 120 

Qy 121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 180 

I I I I I I I I M II I I I I I M I I I I I I I I I i I I I M I I I I I I I I I I I I I I I I I I I l M I I I I 

Db 121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 180 



Qy 



181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 



Db 


181 


1 1 I M 1 I 1 1 1 1 1 1 1 1 1 | t 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 I I I I | | | | | | | | | | j | | | | | | | | | | | 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Qy 


241 


GTAGACAAC C T GC AGGG G GACT CTGGCCGC G GC T AC TAC CT G GAGATGC T GAT C G GGAC C 

N 1 1 1 1 I 1 1 1 1 II 1 I 1 1 1 1 I 1 1 1 1 1 1 1 I 1 1 1 1 I I I | I | | | | | | | | | | | | | | | | | | | j | | | 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


241 


300 


Qy 


301 


C C C C C G C AGAAG CT ACAGATT C T C GT T GAC AC T GGAAGC AGT AACT TT G C C GT GGCAG GA 

1 1 1 1 1 1 M 1 1 1 I 1 1 M 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I i I I | | | | | | | | | | | | || j | | | | 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 


360 


Db 


301 


360 


Qy 


361 


ACCCCGCACTCCTACATAGACACGTACTTTGACACAGAGAGGTCTAGCACATACCGCTCC 
N 1 M II 1 1 II 1 1 | | 1 | | | | | | | | | | | M | | | | | | | | | | | | || | | || | | | | | | | | | | | | | 

AC C C C GC AC T C CT AC AT AGAC AC GT ACT TT GAC AC AGAGAG GT C T AGC AC AT AC C GC T C C 


420 


Db 


361 


420 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 ! M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 
AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


421 


480 


Qy 


481 


GAC CT C GT C AC CAT C C C CAAAGG CTT CAAT ACT T CT T T T CT T GT CAAC AT T GC C ACT ATT 

1 1 1 1 1 M 1 II 1 1 M 1 1 1 1 1 1 I 1 1 1 1 1 M 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | 1 1 1 1 
GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


481 


540 


Qy 


541 


T T T GAAT C AGAGAAT TTCTTTTT GC CT GGGAT TAAAT GGAAT G GAATACTT GGC CTAGCT 
1 N 1 II 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | M 1 1 II 1 I! I I I I | | | | | | | | | | 
TT T GAAT C AGAGAAT TTCTTTTT GC CT GG GATTAAAT GGAAT G GAATACTT GGC CTAG CT 


600 


Db 


541 


600 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 
1 N 1 1 1 1 1 1 f 1 1 | | | 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 f I I 1 1 1 1 1 1 1 1 1 1 | | | | | | 1 1 1 1 1 1 1 1 1 

'''''''I'liiiiiilflllllllFlf 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


601 


660 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 M 1 1 I I I | | | I | M 1 1 1 1 1 1 f 
CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


661 


720 


Qy 


721 


G GAT CT GGGAC CAAC G GAGGTAGT CTT GT CT T GGGT G GAAT T GAAC CAAGT T T GT AT AAA 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I | | | | | | | | M 1 1 1 1 1 1 1 1 1 I I I I I I | | | | | | | | | | M | 

GGAT CT GGGAC CAAC G GAGGTAGT CTT GT CTT GGGT G GAAT T GAAC CAAGT TTGT AT AAA 


780 


Db 


721 


780 


Qy 


781 


GGAGACAT CT GGTAT ACC C CTATTAAGGAAGAGT GGTACTACCAGATAGAAATTCT GAAA 

1 1 I 1 1 1 M 1 1 1 1 I 1 1 1 1 M 1 i 1 1 1 1 1 ! 1 I I M 1 1 1 1 M 1 f M 1 1 1 1 ! 1 1 1 1 1 I I I I I | I | 

GGAGACAT CT GGTAT AC C C CTAT TAAGGAAGAGT GGT ACTAC CAGAT AGAAAT T C T GAAA 


840 


Db 


781 


840 


Qy 


841 


T T G GAAATT G G AG GC CAAAGC CT TAAT C T GGACT GCAGAGAGT AT AAC G CAGACAAGGC C 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I 

TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 


900 


Db 


841 


900 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 

|||||||||l|l||||||tt|lliiiiiiiiiiiiiiiiii i ...... . ... 

1 1 1 1 1 1 1 1 1 ' 1 N 1 1 1 1 1 1 1 1 1 1 I I I I 1 1 1 i ! 1 I 1 1 1 I I I I 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 


Db 


901 


960 


Qy 


961 


GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 
1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 | | | | | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I M | | | | | | | | | || 

GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 


1020 


Db 


961 


1020 



Qy 



102 1 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1080 
I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I | M I I I I I I I I I I I I I I I I I I II I 



Db 1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 10 8 0 

Qy 1081 T AC CTGAGAGAT GAGAAC T C C AG C AGGT CAT T C C GT AT C ACAAT C CT G C CT C AG CT T T AC 114 0 

' I I I I I I I I M I I I I I f I I f ! M I I I M j | j | | | | | | | | | I | | | | | | | | | | | | | | | | | | | 
Db 1081 T AC C T GAGAGAT GAGAACT C C AG C AG GT C AT T C C GT AT CACAAT C CT GCCT C AGCT T T AC 114 0 

Qy 1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 00 

I I t I II M I I I I M I I I I I I I I I I I | I | | | M I I I I I I I I I I I I I I | | | | | | | | | | | | f | 
Db 1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 00 

Qy 1201 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1260 

I f I I f I I I I I I I M M I I I I I I I I I I E I I I I I I I I I I | I I I | | | | | | | | | | | | | | I I I I I 
Db 1201 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1260 

QY 12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

I N I I I I I I M I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I | | | | | | | | 

Db 12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1320 

Qy 1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

I I I I I I I I I I I I I I I I I I | | | | | | | I I I I I II I I I I I I I I I I | I I I I I I I I I I M 

Db 1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

Qy 1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

I I I I I M I I I I I I I I I I I I I I f I I I I I I I I I I I I I I I | I | | | | | M I I I I M I I I I I I I I 
Db 1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

QY 1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

I I I I I I M I I I I I I I I I I I I I | | | | | | | | I I I I I I I I | | | | I I I I I I I I I I I I I I I I I I I 
Db 1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

Qy 1501 C GT GAC C CT GAGGT C GT C AAT GAT GAGT C C T C T CT GGT CAGAC AT C GCT G GAAAT GAAT A 1560 

I I I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I M f I I I I I I I I I I I I | | II I I I I I I 
Db 15 01 C GT GAC C CT GAGGT C GT CAAT GAT GAGT C C T C T CT GGT CAGACAT C GCT G GAAAT GAAT A 1560 

Qy 1561 GC C AGG C CT GACCT CAAGCAAC CAT GAACT C AGCT ATT AAGAAAAT CACAT T T C CAGGGC 162 0 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 15 61 GCCAGGCCT GACCT CAAGCAAC CAT GAACT CAGCTATTAAGAAAATCACATTTCCAGGGC 162 0 

Qy 1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

I I I I I I I I f I I M I I I I I I I I I I I M I I I I I I I I I I II I I I M I I I I I I I I I I I I I I I I I 
Db 1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

Qy 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1740 

I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M | | | | | | i | | | | | | | | | | 
Db 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1740 

Qy 17 41 CTCCCTACTTCCAAGAAAAATAATTAAAAAAATW^CTTCATTCTAAACCAAAAAAAAAAA 1800 

M I I M I I I I I I I I I I I | | | | | | | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1741 CT C C CT ACT T C CAAGAAAAAT AAT T AAAAAAAAAAC T T CAT T CT AAAC CAAAAAAAAAAA 18 0 0 

Qy 1801 AAAA 1804 

I I I I 

Db 1801 AAAA 1804 



RESULT 2 
AAS11701 



ID AAS11701 standard; DNA; 1804 BP. 
XX 

AC AAS11701; 
XX 

DT 24-OCT-2001 (first entry) 
XX 

DE DNA encoding human aspartyl protease 1 (Asp-1) . 
XX 

KW Human; aspartyl protease 1; Asp-1; nootropic; neuroprotective; 
KW aspartyl protease 2; Asp2 ; amyloid protein precursor; APP; 
KW beta-secretase; Alzheimer's disease; ds . 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT CDS 1. .1557 

FT /*tag= a 

FT /product= "Aspartyl protease-1 (Asp-1) " 

XX 

PN WO200149097-A2 . 
XX 

PD 12-JUL-2001. 
XX 

PF 09-MAY-2001; 2001WO-IB000797 . 
XX 

PR 09-MAY-2001; 2 001WO-IB00 07 97 . 
XX 

PA (BIEN/) BIENKOWSKI M J. 

PA (GURN/) GURNEY M E. 

PA (HEIN/) HEINRIKSON R L. 

PA (PARO/) PARODI L A. 

PA (YANR/) YAN R. 

XX 

PI Bienkowski MJ, Gurney ME, Heinrikson RL, Parodi LA, Yan R; 
XX 

DR WPI; 2001-502548/55. 

DR P-PSDB; AAU07201. 
XX 

PT Novel purified polypeptide comprising fragment of mammalian aspartyl 

PT protease 2, lacking Asp2 transmembrane domain and retaining beta 

PT secretase activity of Asp2 useful for identifying inhibitors of Asp2 

PT activity. 

XX 

PS Example 2; Fig 1; 185pp; English. 
XX 

CC The invention relates to a novel purified polypeptide comprising a 

CC fragment of mammalian aspartyl protease 2 (Asp2) protein which lacks the 

CC Asp2 transmembrane domain and the Asp2 protein, and where the polypeptide 

CC and the fragment retain the beta-secretase activity of the mammalian Asp2 

CC protein. Also included is an isoform of amyloid protein precursor (APP) 

CC comprising the amino acid sequence of a APP or its fragment containing an 

CC APP cleavage site recognisable by a mammalian beta-secretase, and further 

CC comprising two lysine residues at the carboxyl terminus of the amino acid 

CC sequence of the mammalian APP or APP fragment. The polypeptides are used 

CC for assaying for modulators of beta-secretase activity; identifying 

CC agents that inhibit the APP processing activity of human Asp2 aspartyl 

CC protease (Hu-Asp2) ; identifying agents that modulate the activity of Asp2 



CC ; and for reducing cellular production of amyloid beta (Abeta) from APP . 

CC Agents identified by the above methods are useful for treating 

CC Alzheimer's disease; and for identifying modulators of amyloid-beta 

CC (Abeta) peptide production, for use in designing therapeutics for the 

CC treatment or prevention of Alzheimer's disease. Probes and primers 

CC derived from Asp nucleic acid sequences are useful for detecting Hu-Asp 

CC nucleic acids in in vitro assays and in Northern and Southern blots. The 

CC present sequence represents the coding sequence of human Asp-1 

XX 

SQ Sequence 1804 BP; 397 A; 520 C; 458 G; 429 T; 0 U; 0 Other; 

Query Match 100.0%; Score 1804; DB 4; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 

1 N 1 1 1 1 I 1 1 1 M 1 1 1 ! 1 1 1 1 1 1 1 1 1 I I I f I I | | | ! | | | I | j | | | | | | | | | | | | | | | | | [ 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGr^rr 


0 u 


Db 


1 


D U 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 

1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I I I I | | | | | M | | | | | | | | | | | || 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


1 on 

.L ^ U 


Db 


61 


1 Of) 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 
1 1 1 1 1 1 1 M II 1 1 1 1 1 1 I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | J | | | | 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


X 0 u 


Db 


121 


J. 0 \J 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

t i 1 1 1 M 1 1 1 1 f 1 ! 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | 1 1 1 1 1 1 1 1 1 I 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


^ *± \j 


Db 


181 


24 0 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 

1 1 N 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I | | | | | | | | | | | | | | | | | | | M 1 1 II 1 1 1 1 1 1 1 1 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 

\J \J 




O /I 1 


300 


Qy 


301 


C C C C C GC AGAAGCT ACAGAT T C T C GT T GAC ACT G GAAG C AGT AACT T T GC C GT G GC AGGA 

1 1 1 t 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | | | | | | | | | | | 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 


360 


Db 


301 


360 


Qy 


361 


AC C C C GCACT C CT ACAT AGAC AC GT ACT T T GAC ACAGAGAG GT CTAGC AC AT AC C GCT C C 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | I | | | | M | | | | | | | | | | | | | | | | | | | 

AC CC C GCACT C CT AC AT AGACAC GTACT T T GAC ACAGAG AGGT C TAGC AC AT AC C G C T C C 


420 


Db 


361 


420 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

1 1 1 1 1 I 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 I 1 1 ! 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I 1 1 1 | M 1 M 1 1 1 1 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


421 


480 


Qy 


481 


GACC T C GT C AC CAT C C C CAAAGG CTT CAAT AC TTCTTTTCTT GT CAAC AT T GC CAC TAT T 

1 1 1 1 1 f N 1 II 1 1 1 1 1 1 1 I I I I | I | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | | | | | | | | | 

GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


481 


540 


Qy 


541 


TT T GAAT CAGAGAAT TTCTTTTTGCCT G GGAT TAAAT GGAAT GGAAT ACT T GGC C T AGCT 

1 1 1 ' 1 1 1 1 1 1 1 1 1 1 II I M 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | | | | | | | | | | 

TT T GAAT CAGAGAAT TTCTTTTTGCCT G GGAT TAAAT GGAAT GGAAT ACT T GGC C T AGC T 


600 


Db 


541 


600 



Qy 



601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 
I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 



601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

I I I f I I I I I I M I I I I i M I I I I I I I I I I M I I I I I II I I I I | I | | | | | M I II | | I M I 
661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 7 80 

N M I II I I I I I I I I I I I | | | | | | | | | | | | | | | | ( | | | | | M II I | | | | | | | | | | | M | | 
721 GGAT C T G G GAC CAAC G GAGGT AGT CTTGTCTT GG GT G GAAT T GAAC CAAGT T T GT ATAAA 7 80 

7 81 GGAGACAT CT G GT AT AC C C CT AT T AAG GAAGAGT G GTACT AC CAGAT AGAAAT T C T GAAA 84 0 

I I I I N I I I I I I I I I I I I I I I I i I I I | | | | | | M II I I I I I I I I I I I | | | | | | | I I I I M 
7 81 GGAGACAT CTGGTATAC CC CTATTAAGGAAGAGT GGTACT AC CAGAT AGAAAT T CT GAAA 840 

841 TT G GAAAT T GGAGGC CAAAGC C TT AAT C T GGAC T G C AGAG AGTATAAC GC AGACAAGG C C 900 

N I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 

841 T T GGAAAT T GGAGG C CAAAGC CT TAAT C T G GACT GC AGAG AGTATAAC G C AGACAAG G C C 900 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGT GTTTGATGCGGTGGTG 960 

I I N I I I I I I I I I I I I I | | | M I I I I I I I I If I I I I I I I I I I I I | | I I I I I I I I I I | I I I 
9 01 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 9 60 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 

N I I I I I I I I I I I I I | M I I I I I I I I I I I I I I M I I I I I I I I I I I I | | | | M I I I I I I I | 
961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

I I I I I I I I I I I I I I I I I I I | | | | | | | | | [I I I I I I I | | | | | | I I I I I I I I I I I I I I I I | I 
1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 10 8 0 

1081 T AC CT GAGAGAT GAGAAC T C CAGCAG GT CAT T C C GT AT CAC AAT C CT GC CT C AG CT T T AC 114 0 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I | I I I I I I I I I I I I I I I I I I I | | | | | | | | | 
1081 T AC C T GAGAGAT GAGAAC T C CAGC AGGT CAT T C C GT AT CAC AAT C CT GC C T C AG C T TT AC 114 0 

1141 ATT CAGCCCAT GAT GGGGGCCGGCCTGAATTAT GAAT GTTACCGATTCGGCATTTCCCCA 1200 

M f I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1200 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

N I I I I I I I I I I M I I I I I I I I I I | | | | | | | I I I I I I I I I I M I | | I I I I I M I I I I I I I 
12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1320 

' I I I I I I I I I I I I I I I I | | | | | | i I I I I I I I I I I I I M I I I I M I I I I I I I I I f I I I I I I 
1261 AGAG C C C AGAAGAGG GT GG GC T T C GCAG C GAG C CC C T GT GC AGAAAT T GC AGGT G C T G C A 1320 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 13 8 0 

I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I | I | | | | I I I I I I I I I 
1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 13 8 0 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I | | | | | | | | | M | | | | 
1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 150 0 

N I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I | | | | | | | | | | 
1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 



Qy 


1501 


C GT GAC C C T GAGGT C GT CAAT GAT GAGT C CT C T CT GGT C AGACAT C G CT G GAAAT GAAT A 

1 1 1 1 I 1 1 1 1 1 M 1 1 1 1 1 1 1 I I I ! | | | | | | If 1 1 I 1 I II I 1 1 1 1 M 1 1 1 1 I 1 1 1 I I I | I I I 

C GT GAC C CT GAG GT C GT CAAT GAT GAGT C CT CT CT G GT C AGACAT C GC T G GAAAT GAAT A 


1560 


Db 


1501 


1560 


Qy 


1561 


G C C AGG C C T GAC CT CAAG CAAC CAT GAAC T C AGCT AT TAAGAAAAT C AC AT T T C CAG GG C 

' 1 1 1 1 1 1 1 1 i 1 1 1 1 1 II 1 1 1 1 1 1 I 1 II 1 1 1 1 1 1 | | | | | | | 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 | | 1 1 
GCCAGGCCTGACCTCAAGCAACCATGAACTCAGCTATTAAGAAAATCACATTTCCAGGGC 


1620 


Db 


1561 


1620 


Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 
f 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 | | M 1 1 1 1 1 1 1 I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 | | I | 
AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


1621 


1680 


Qy 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 

1 1 N 1 1 > 1 M 1 1 I M 1 1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 I 1 1 1 1 
GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1681 


1 74H 
J_ / *± u 


Qy 


1741 


CT C C C T ACT T C C AAGAAAAAT AAT T AAAAAAAAAAC T T CAT T C T AAAC C AAAAAAAAAAA 

1 ' 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CT C C CT AC T T C CAAGAAAAATAAT T AAAAAAAAAAC T T CAT T CTAAAC CAAAAAAAAAAA 


1800 


Db 


1741 


1800 


Qy 


1801 


AAAA 1804 
1 1 1 1 

AAAA 1804 




Db 


1801 





RESULT 3 
AAD17864 

ID AAD17864 standard; cDNA; 1804 BP 
XX 

AC AAD17864; 
XX 

DT 10-DEC-2001 (first entry) 
XX 
DE 
XX 
KW 
KW 
KW 



Human aspartyl protease 1 (hu-Aspl) cDNA. 



Human; aspartyl protease 1; Aspl; amyloid precursor protein; APP; 
Alzheimer's disease; AD; dementia; neurofibrillary tangle; gliosis; 
amyloid plaque; neuronal loss; proteolytic; nootropic; neuroprotective; 
KW chromosome 21; ss. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT CDS 1. .1557 

FT /*tag= a 

FT /product= "Human aspartyl protease 1" 

FT sig_peptide 1. .60 

FT /*tag= b 

FT mat_peptide 61. .1554 

FT /*tag= c 

FT 
XX 

PN GB2357767-A. 
XX 

PD 04-JUL-2001. 
XX 



/product= "Mature human aspartyl protease 1" 



PF 22-SEP-2000; 2 OOOGB-00023315 . 
XX 

PR 23-SEP-1999; 99US-004 04 133 . 

PR 23-SEP-1999; 99US- 01554 93P . 

PR 23-SEP-1999; 99WO-US020881 . 

PR 13-OCT-1999; 99US-004 1 6901 . 

PR 06-DEC-1999; 99US- 0 1 69232P . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Bienkowkski MJ, Gurney M; 
XX 

DR WPI; 2001-444208/48. 

DR P-PSDB; AAE10628. 
XX 

PT Polypeptide comprising fragments of human aspartyl protease with amyloid 

PT precursor protein processing activity and alpha-secretase activity, for 

PT identifying modulators useful in treating Alzheimer's disease 
XX 

PS Claim 30; Fig 1; 187pp; English. 
XX 

CC The patent discloses human aspartyl protease 1 (hu-Aspl) or modified Aspl 

CC proteins which lack transmembrane domain or amino terminal domain or 

CC cytoplasmic domain and retains alpha-secretase activity and amyloid 

CC protein precursor (APP) processing activity. The proteins of the 

CC invention are useful for assaying hu-Aspl alpha-secretase activity, which 

CC in turn is useful for identifying modulators of hu-Aspl alpha-secretase 

CC activity, where modulators that increase hu-Aspl alpha-secretase activity 

CC are useful for treating Alzheimer's disease (AD) which causes progressive 

CC dementia with consequent formation of amyloid plaques, neurofibrillary 

CC tangles, gliosis and neuronal loss. Hu-Aspl protease substrate is useful 

CC for assaying hu-Aspl proteolytic activity, by contacting hu-Aspl protein 

CC with the substrate under acidic conditions and determining the level of 

CC hu-Aspl proteolytic activity. The present sequence is a cDNA encoding 

CC human Aspl protein. Aspl gene is localised on chromosome 21 
XX 

SQ Sequence 1804 BP; 397 A; 520 C; 458 G; 429 T; 0 U; 0 Other; 

Query Match 100.0%; Score 1804; DB 4; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

I I I N ! I I I ! I I I ! I I M I I f I I I I I I ! I I M I I I I I I I M I I f| I I I I I I I I I I I I I I I 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 120 
>' M I I I M I I I I I I I I I I f I I f | I | | M | | | | | | | | | | | | | | | | | | | | | | || ,, | | | | , 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 
I M I I I I I I I I I I I I | f | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | , | | , , | | | 
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' 1 I I I I I I I I I I I I I I M I I I I I I I I I I I I I | | | | | | | | M I I I I I I I I I I I I I | | M I I 



241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 30 0 

I I I M I I I I I I M I I | j | M M I ! I I I I I I I I I f I | | | | I | f f M I I I I I I I I I I I I I I I 

241 GT AGACAAC C T GC AGG GG GACT CTGGCCGCGGC T AC TAC CT G GAG AT GC T GAT C GGGAC C 300 

301 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 

I ' M I I I I I I I I I I I M I I I I I I I ! I | I | I I I I I I M I I I I I I | | | | | I I I I I I I I I I I I 
301 C CC C C GC AGAAGCTAC AGAT T C T C GTT GAC AC T G GAAG C AGT AAC TTTGCCGT GG C AG GA 360 

361 AC C C C GC AC T C C T ACAT AGACAC GTACT T T GAC AC AGAGAGGT C T AGC AC AT AC C G C T C C 42 0 

' ' I I I I I I II I I I I I I | I | | | | I I I I I I I I I I I I I I I I I I | | | | | | | | | I I I I I I I I I I I 

3 61 ACCCCGCACTCCTACATAGACACGTACTTTGACACAGAGAGGTCTAGCACATACCGCTCC 420 

421 AAG GG C T T T GAC GT CAC AGT GAAGT AC AC AC AAGGAAGC T GGAC G GG CT T C GT T G GG GAA 4 80 

I ' I ' I I I I I I I I I I I I f I I I I I I I I I I I f I | | I | | I M I II I I I I I I I I I I I | | ft I I I I 
421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 80 

481 GACCTCGT CACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 
' ' I I I I I I I I I I I I I I I I I | | | | | | | I I II I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

4 81 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

541 T T T GAAT C AGAGAAT TT CTTTTTGCCT G GGAT T AAAT G GAAT G GAAT ACT T GGC CTAGCT 600 

' I I I I I I I I I I I I I I I I I I I I I | | | | | | | | M I I I I f I II I I I I I I I I M I I I I I I I I I | 
541 TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 600 

601 TAT G C CAC ACT T GC CAAG C CAT CAAGT T CT C T GGAGAC CT T C T T C GAC T C C CT G GT GACA 660 

I I I I I I I I I I I I I I I I | | | | I M I I I I I I I I I | | | | | | I I I I I I I I I I | | | | | I I 

601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

I I N I I I I I I f I I | | | | | | | | | | | | | I I I I I I I I I I M I I I I I I I | | | | | | M I I I I I I I 
661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 720 

721 GGAT CT G G GAC CAAC GGAGGT AGT CTTGTCTTGGGTG GAAT T GAAC CAAGT TT GT ATAAA 780 

I ' I f I I I M I I I I I I I I I I I I I I I I I I I I | M | | | | | | | | | | | | | | | M| 

721 G GAT CT GGGAC CAAC GGAG GTAGT CTT GT CTTGGGTGGAATT GAAC CAAGT TTGT AT AAA 780 

781 GGAGACAT CT GGTATAC CC CTATTAAGGAAGAGT GGTACTACCAGATAGAAATT CT GAAA 84 0 

I I I M I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | I I I I f I | I I I I I I I I I I I I | 
781 GGAGACAT CT G GTATAC C C CT ATT AAG GAAG AGT GGTAC TAC CAGAT AGAAAT T CT GAAA 84 0 

841 T T GGAAAT T GGAGG C CAAAGC CT T AAT C T GGACT GC AGAGAGTAT AAC GCAGACAAGGC C 900 

I I I I I I I I I I I I I I I II I I M I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
8 41 T T GGAAAT T GGAG GC CAAAG C CT TAAT CT G GAC T GC AGAGAGTAT AAC G C AGACAAGG C C 900 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

I I I I I M I i I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | I I I I I 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

> I I I I I I I I I I M I I I I II I I I I I I I | | I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACAC CTT GGTCTTACTTCCCTAAAATCTC CATC 1080 

' I I I I I I I I I I I I I I I I I I I I | | I | | | I I I I I I I I I I I I I I I II I 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 



1081 T AC CT GAGAGAT GAGAAC T C C AG CAG GT CAT T C C GTAT C ACAAT C C T G C C T C AGCT T T AC 114 0 

I ' M I M I I I I I I I I I I I I I I I I I I I I I I | | | | | M | | | | | | | | j | | | | | | | | | | | | | | | 
1081 TAC C T GAGAGAT GAGAACT C CAG C AGGT CAT T C C GT AT C ACAAT C CT G C C T C AG CT T T AC 114 0 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 00 

I I I I I I I i I f ! I M ! I I I I I I I I M I I ! I f I I I ! I I i | M I f I I I I I I i i I I I | | 1 | | M 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 00 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

> I I I I I M I i f I I I I M I I I I j I I I I II I I I I M I I I I I I I | | | | | | | | | | | | | j I I I I I 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1320 

I I I I I I I I I I M I I I I I I I I I I I f I I | | | | | | | | | I I I I I I I I I I I I I I M I I I I 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 13 8 0 

> I I I M I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | ( | | | | ,,, | ,,,,,, , 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

I I I I M I M I I I I I I I I I I | I I I I I I I I I I I If II I I I I | | | | | | | | | | | | | | | | | | | | | 
1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

>> I I I I 1 I I I II I I M I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I I I I I I I I I I 
1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

1501 C GT GAC C C T GAGGT C GT CAAT GAT GAGT C CT C T C T GGT C AGACAT C G CT GGAAAT GAAT A 1560 

< I I I I I I M I I I I I I I I I I I I I I I I I I I I | I I I I I I M I I I f I I II I I I I I I I I I I II I I 
1501 C GT GAC C CT GAG GT C GT CAAT GAT GAGT CCTCTCTGGTCAGACATCGCT GGAAAT GAAT A 1560 

1561 GC CAGG C C T GAC CT CAAGCAAC CAT GAAC T C AG CT ATT AAGAAAAT C ACAT T T C C AGGGC 1620 

' I ' I I I I I I I I I I I I I I I I M I M I I I I I I I | | | | M I I I I I I I I I I | | | | | | | f | | | | | 
1561 G C C AG GC CT GACC T CAAGCAAC CAT GAAC T CAG CT ATT AAGAAAAT C AC AT T T C C AGGG C 1620 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

1 I M I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | I I I I I I I I I I I I I I I I I I I I 
1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 17 4 0 

>> I I I I I I I I I I I I I I I I I I I I I I | | | | | | | I I I I I I I | | | M I I f M I I I | | f | | | | | | 
1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 17 4 0 

1741 CT C C C TACT T C CAAGAAAAAT AAT TAAAAAAAAAACT T CAT T CT AAAC CAAAAAAAAAAA 18 00 

1 I I I I I I I I I I I I I I I I I I I I II I I | 1 M I I I I I M I I I I | | | || | | || | 

1741 CT C C C T ACT T C CAAGAAAAAT AAT T AAAAAAAAAAC T T CAT T CTAAAC CAAAAAAAAAAA 18 00 

1801 AAAA 1804 
I I I I 
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Human aspartyl protease 1 (Hu-Aspl) cDNA. 

Human; aspartyl protease 1; Asp 1; beta-amyloid precursor protein; APP; 
beta-secretase; Alzheimer's disease; dementia; amyloid plaque; gliosis; 
neurofibrillary tangle; neuronal loss; amyloid-beta peptide; nootropic- 
neuroprotective; antisense therapy; gene therapy; chromosome 21; 
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XX 
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XX 
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XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 
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FT 
FT 
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FT /*tag= b 

FT mat_peptide 61. .1554 

/*tag= c 
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/*tag= a 

/product^ "Human aspartyl protease 1 (Hu-Aspl) 
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/product- "Mature human aspartyl protease 1 (Hu-Aspl)" 
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PT Novel purified polypeptide comprising fragment of mammalian aspartyl 

PT protease 2, lacking Asp2 transmembrane domain and retaining beta 

PT secretase activity of Asp2 useful for identifying inhibitors of Asp2 

PT activity. 

XX 

PS Example 2; Fig 1; 185pp; English. 
XX 



The invention relates to human aspartyl proteases (Hu-Asp) , beta-amyloid 
precursor protein (APP) isoforms and their corresponding DNA molecules . 
Human aspartyl proteases can act as beta-secretase proteases useful for 
treating Alzheimer's disease. APP isoforms are useful for identifying 
modulators of amyloid-beta peptide production, for use in designing 
CC therapeutics for the treatment and prevention of Alzheimer's disease, 

dementia, formation of amyloid plaques, neurofibrillary tangles, gliosis 
and neuronal loss. APP isoforms are also used in methods for identifying 



CC inhibitors and modulators of human Asp2 activity. The invention relates 
CC to a method for identifying agents that modulate the activity of human 
CC aspartyl protease Asp2 . Amyloid-beta peptides obtained from APP are used 
as a means to screen in cellular assays for the inhibitors of beta- and 
gamma- secretase. Hu-Asp DNA fragments are useful as probes or primers in 
polymerase chain reactions (PCR). The probes are useful for detecting Hu- 
Asp nucleic acids in in vitro assays and in Northern and Southern blots. 
The present cDNA sequence encodes human aspartyl protease 1 (Hu-Aspl) . Hu 



CC 
CC 
CC 
CC 
CC 
CC 

CC -Asp 1 gene is localised on chromosome 21 
XX 

SQ 



Sequence 1804 BP; 397 A; 520 C; 458 G; 429 T; 0 U; 0 Other; 

Query Match 100.0%; Score 1804; DB 4; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

" 1 I 1 I I I I M I I I I I I I | | | I | | | | ! | | | M I ! I I I I I I If | I I I I I ! I I f I | | | | f | | 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 
1 '> I I I I I I II I I I I I I I I I M I I I I 1 I I I I I | | | I I I I I I I I I I f I I | | | | | M I I I II 
GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I If I I I I I I ! I I I I I I I I 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 

I ' I I I I I I I I I I M I I I I I If I I I | | I I I I I ! I I I I M I I I I I I I I I I If 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 30 0 

I I I I I I I I M I I I ! I I I I II I I | | | | | | | I I I I I I I I f I I I I I I I | | | | | | | | | | 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 30 0 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 
N I I I I I I I I I I || | | | | | | | | | | | | | | | | | | | | | | | | | | | , , , , , | | | | , , | , , 

C C C C C GC AGAAGC T AC AGAT T C T C GT T GAC AC T GGAAGC AGT AACT T T GC C GT GG CAG GA 3 60 

AC C C C G C ACT C C T AC AT AGAC AC GT ACT TT GACAC AGAGAG GT CT AGC AC AT ACC GC T C C 42 0 

1 1 ' I I M I I I I I I M | | | | | I I I I I I I I I | | I I f i I I I I I I I I I I I I | | | | I I I f 

AC CCC GCACT C CTACAT AGACAC GTACTTT GAC AC AGAGAG GT C TAG CACAT ACC GCT CC 4 2 0 

AAG GGCT T T GAC GT C AC AGT GAAGT ACAC ACAAGGAAG CT GGAC G G G CT T C GT T GG GGAA 4 8 0 

f I I I I I I I I I I I M I M I I I I If I I I I I I I | | I I I If I I f I i | | | | | | l | | | 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 8 0 
GACCTCGT CACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

1 1 I I I I I I M I I I I I I I f I I I M i M II I I f I If I I I f I | | | | | | | | | f | | | | | | | | | | | 

GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 
T T T GAAT C AGAGAAT TTCTTTTTGCC T GGGAT T AAAT GGAAT GGAAT AC T T G GC CTAGC T 600 

> I I M If I I I I I I I I I I I M | | | | | I I I I I I I I I f M I I f f I I I f | | f f | | | | | I I I I I I 

TTT GAAT CAGAGAATTTCTTTTTGCCTGGGATTAAAT GGAAT GGAATACTTGGCCTAGCT 60 0 
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Qy 
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Qy 
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Qy 


541 


Db 


541 


Qy 


601 1 



TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTC 



CCTGGTGACA 660 



601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 
M I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | |, | | | | | , , | || 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 
721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 780 

f I I I I I I ' i I f I f I I I M M I I I I I I I M I I M I f I 11 I I I I I I I I I I I M I I i i i i i i i 

721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 780 

781 GGAGACAT CT G GT AT AC C C CT ATTAAGGAAG AGT GGTACT AC CAGAT AGAAAT T CT GAAA 84 0 

' I I M I I I I ! f I I I I I I f I I I I I I I I I I I I | | | | | | | | | I f I I I I I I I I M I I I I 

781 G GAGAC AT C T GGT ATAC C C CT AT TAAGGAAGAGT GGTAC T AC CAGAT AGAAAT T C T GAAA 84 0 

841 TT GGAAATT GGAGGC CAAAGC CTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGC C 900 

' I I M I I I I M I I I I I I I I I I I I M I I II I I II | | | | | | | | | | | | | | | | | | | | | 

841 TT GGAAAT T GGAG G C CAAAGC CT TAAT C T GG ACT GC AGAGAGT AT AAC GC AGAC AAGG C C 900 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

i I I M i M I I I I I I I I M I I I I I M I I II M I I I I I I M I I I I I I I I I I I | | | | | [I I I I 
901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 

I ' I I ' I I I I f N I f I I II I I I I I I I I I If I | | | I I I I I I I I I I I I I I I I I I II I I I I I I I 
961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 10 8 0 

> I I M I I I I I I I I I I I I I I I I I | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | 
1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 10 80 

10 81 TAC C T GAGAGAT GAGAAC T C C AG CAG GT CAT T C C GT AT CACAAT C C T GC CT C AG C T T T AC 114 0 

I I I ' I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

1081 TAC CT GAGAGAT GAGAACT C CAG CAGGT CAT T C C GT AT CACAAT C C T G C C T CAGCT T TAC 1140 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1200 

' I I I N I I I I I I I I I I I II I I I I I I I I I I | | | | | | | | | | | | | | M I I I I I I I I I | | | | | | 
1141 AT T C AGC C CAT GAT GGG GGC C GG C CT GAAT TAT GAAT GTT AC C GAT T C GGC AT TT CC CC A 12 00 

12 01 T C CACAAAT GC GC T GGT GAT C G GT G C CAC GGT GAT G GAGGG CTT C TAC GT CAT CT TC GAC 12 60 

I I I I I I I I I I I I I I I I I M I I I I I I I M II I I I I I | M M II I I I I M I | | | | | | 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

I 1 I I I I I I H I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I | | | | | | | | | | || | 
12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

'< I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I | M I I I I I I | | | | | | | | | | | | | | | I 
1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

1 I ' 1 I I I I I I II I I I I i I | | | | | | | | I I II I I I I I I I I | f | | | M I I I I I I I I | I | | I I I 
1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 
' I I I I I I I I I I I I I | | | | | | | | | | | | | I I I I I I I I I I I I I | | | | | | | I I I I I M 



Db 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Qy 


1501 


C GT GAC C C T GAG GT CGT CAAT GAT GAGT C C T C T CT GGT CAGAC AT C G CT G GAAAT GAAT A 

' 1 1 f 1 I I 1 1 I II 1 1 I i 1 1 1 1 II 1 1 1 1 1 1 1 I | t 1 1 1 1 1 1 I! 1 1 1 1 1 1 1 1 1 1 1 I 1 1 

C GT GAC C CT GAG GT C GT CAAT GAT GAGT CCTCTCTGGTCAGACATCGCTGGAAAT GAAT A 


1560 


Db 


1501 


1560 


Qy 


1561 


G C C AG GC CT GAC CT CAAG CAAC CAT GAAC T C AGCT ATT AAGAAAAT C AC AT TT C CAGGGC 

'< I 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 M 1 1 1 1 1 1 

GC C AGG C C T GAC CT CAAGCAAC CAT GAACT C AG C TAT TAAGAAAAT C ACAT T T C CAGG G C 


1620 


Db 


1561 


162 0 


Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 
'IN 1 1 1 1 1 I I || I || I | | | | | | | | | | | | | | || | || | | | | | | | | | | | ,| , | | , | | 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


1621 


1680 


Qy 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 
1 < 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 II 1 1 1 I I If | | 1 1 1 1 1 1 | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1681 


1740 


Qy 


1741 


CT C C CT ACTT C CAAGAAAAATAAT TAAAAAAAAAACTT CAT T CTAAACCAAAAAAAAAAA 

> ' 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 f I | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | 1 1 1 1 1 1 1 
C T C C CT ACTT CC AAGAAAAATAAT T AAAAAAAAAACT T CAT T C TAAAC CAAAAAAAAAAA 


1800 


Db 


1741 


1800 


Qy 


1801 


AAAA 1804 




Db 


1801 


1 1 1 1 

AAAA 1804 





RESULT 5 
AAD06738 

ID AAD06738 standard; cDNA; 1804 BP. 
XX 

AC AAD06738; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Human aspartyl protease 1 (Asp 1) cDNA. 
XX 

KW Human; alpha-secretase ; amyloid precursor protein; APP; therapy; 

KW Alzheimer's disease; antialzheimer ' s ; aspartyl protease 1; Asp 1; 

KW beta-secretase; chromosome 21; ss . 
XX 

OS Homo sapiens . 



XX 

FH Key Location/Qualifiers 

FT CDS 1. .1557 

FT /*tag= a 

FT /product= "Human aspartyl protease 1 (Asp 1) " 

FT sig_peptide 1. .60 

FT /*tag= b 

FT sig_peptide 64. .186 

FT /*tag= c 

FT /note= "Pre-pro-peptide" 

FT sig_peptide 67. .186 

FT /*tag= d 

FT /note= "Pro-peptide" 

FT mat__peptide 187. .1554 

FT /*tag= e 



ft 



Enzymes that cleave the alpha-secretase site of the amyloid precursor 
protein, useful for the treatment of Alzheimer's disease. 



FT /product^ "Human mature aspartyl protease 1 (Asp 1) 
XX 

PN WO200123533-A2 . 
XX 

PD 05-APR-2001. 
XX 

PF 22-SEP-2000; 2 00 0WO-US 02 608 0 . 
XX 

PR 23-SEP-1999; 99US-0 1554 93P . 

PR 23-SEP-1999; 99WO-US020881 . 

PR 13-OCT-1999; 99US-004 1 6901 . 

PR 06-DEC-1999; 99US-01 69232P . 
XX 

PA ( PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Gurney M, Bienkowski MJ; 
XX 

DR WPI; 2001-290516/30. 

DR P-PSDB; AAE02580. 
XX 
PT 
PT 
XX 

PS Example 2; Fig 1; 18 9pp; English. 
XX 

CC The present invention relates to enzymes for cleaving the alpha- 

CC secretase site of the amyloid precursor protein (APP) and methods of 

identifying those enzymes. The methods may be used to identify enzymes 
that may be used to cleave the alpha-secretase cleavage site of the APP 
protein. The enzymes may be used to treat or modulate the progress of 

CC Alzheimer's disease. The present sequence is human aspartyl protease 1 

CC (Asp 1) cDNA. Asp 1 has alpha-secretase protease and beta-secretase 

CC protease activities. Asp 1 gene is located on chromosome 21 
XX 

SQ Sequence 1804 BP; 397 A; 520 C; 458 G; 429 T; 0 U; 0 Other; 

Query Match 100.0%; Score 1804; DB 4; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 18 04; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

N M I I I I I I I I ! I I f I I M I I I I I I I I I 1 I I I | | ! | f | f | M I I I ! f I I I I I M I I M I 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 
GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

I I I I I I I I I I I I I I I I I I | | ! M I I I M I I M I I I I I M | | | | | M I I I 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 120 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 2 4 0 

I I I ' I I I I I I I I I II I I I I I I | | I I I I I I I I | I I I M I I I I I I | | t I f I I I I I I I I I I I I 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 2 4 0 



CC 
CC 
CC 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



Qy 



241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 



241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

301 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 3 60 
I N I I I I I I I I I i I i I i | | | | | | | | | | | ( | | | | | | | f | | | | | | | | | , | | , 

301 C C C C C G CAGAAGCT AC AGATT C T C GT T GAC AC T GGAAGCAGTAAC T T T G C C GT G G C AG GA 360 

3 61 AC C C C G CAC T C C T ACATAGAC AC GT AC T T T GAC ACAGAGAG GT C TAG C ACAT AC C GC T C C 42 0 
I ' I I ! I f I I I I i I I f | | | | | | | | | | | | | | | | I I I I I | I | | If 

3 61 AC C C C G CAC T C CT AC AT AGAC AC GT ACT TT GAC ACAGAGAGGT CT AG CAC AT AC C G CT C C 42 0 

421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 48 0 

N I I M I I ! | | | | | M | | | | | | M | | M I II | | | I I I I I I I I I I I I I I 

421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 80 

4 81 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

N I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | M I I I I 
4 81 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

541 T T T GAAT CAGAGAAT TTCTTTTTGCCT G GGAT TAAAT GGAAT G GAAT ACT T G G C CT AGC T 600 

I I I I I I I I I M I I I I I I I I I I I I | | | | | | | | M I II I I I I I II I I I I I I I I I I I M I I I I 
541 T T T GAAT CAGAGAAT TTCTTTTTGCCT GGGAT TAAAT GGAAT GGAAT AC T T GGC C T AGCT 600 

601 TAT G C C ACACT T GC CAAGC CAT CAAGT T CT CT G GAGAC C T T CT T C GACT C C CT GGT GAC A 660 

> I f M I f I ! I I I I f I I I I I I I i I I I I I I I I II I I I I I I I I M I I I I I I I I I I I I I I I I I I 
601 TAT GC C ACACTT GC CAAGC CAT CAAGT T CT C T GGAGAC C T T C T T C GAC T C C C T GGT GAC A 660 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

' M I I I I I I I I M I I I I I I I | I I I I I I I I I I I I M | I I I I I I I I I I I | M I I I I I I I I I I 
661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 720 

721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 780 

I I ' I I N I I I I I I I I M I I I | | | | | | | | | I I I I I I I I I I I I I I I I I I I I I | | | | | | | | I I 
721 GGAT CT G GGAC CAACG GAG GT AGT CT T GT CT T GGGT G GAAT T GAAC CAAGT T T GTATAAA 78 0 

781 GGAGAC AT CT GGT AT AC C C CT AT T AAGGAAGAGT G GT AC T AC CAGAT AGAAAT T CT GAAA 84 0 

' I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I | I I I M I I II I I I I I I I I I I I I I I I 
781 GGAGAC AT C T G GTATAC C C CT AT T AAGGAAGAGT GGT ACT AC CAGAT AGAAATT CT GAAA 84 0 

841 T T G GAAATT G GAGGC CAAAGC CT TAAT C T GGAC T GCAGAGAGT AT AAC G C AGACAAGGC C 900 

I t I I I I f I I I I I I I I t I I I I I I I I I I I I | | | | | | | I II I I | | | | | | | | | | | | | || 

841 T T GGAAATT GGAGG C CAAAGC C T TAAT CT GGACT G CAGAGAGTAT AAC GC AGACAAG GC C 900 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 9 60 

I I f I I I I M I I f I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I M I I I I 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

N I I I I I I I I I I M I I I I I I I I I I I I | | | | | | | | I I I I I I I I I I I I M I I I I I I I I I I I I 
961 GAAGCT GT GGC CCGCGCATCTCTGATTCCAGAATTCTCT GAT GGTTTCT GGACT GGGT CC 102 0 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 10 8 0 

' I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | I I I I I I I I I I I II 
1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

1081 T AC CT GAGAGAT GAGAAC T C CAGCAGGT CAT T C C GT AT C AC AAT C C T GC C T C AG C TT T AC 114 0 
I N I I I I I I M I I I I I I I I I I I I I | | | I | | I | | | | | | | | I I I M I I M I I I I I I I I I I I I 



1081 T AC C T GAGAGAT GAGAAC T C CAG C AGGT CAT T C C GT AT C ACAAT C CT GC CT C AGC T T T AC 114 0 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 00 

I i I I I I I I I I I I M M II II I II II I I I | I | | I | | | | | | | | | || | | | | | | ( | | | | | | | M 
1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 00 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

I I I I I I I I I I I I I I I I I II I I II I | | | | | | | | | | | | | | M I I I I I I I I I I I I 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1320 
' ' I I I I I I M I I I I I I I I I I | | | | | | I I I I I I I I I I I I I | | f | | M I I I I I I I I I I | | | | 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 1380 

> I I I I I ' I I I I I I I I I I I | | | | | | | | I I I I I I | | | I I I I I I M I I I I I I I I I I I I I I I I I 
1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 144 0 
' ' I I I I I I I I I I I I I I | I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | 

13 81 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | M M I I I I I I I I I I | | 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 15 00 

15 01 C GT GAC C CT GAGGT CGT C AAT GAT GAGT C C T C T CT GGT C AGACAT C G CT G GAAAT GAAT A 1560 

I I ' ' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I I I I I I I I I I I I | I I 
15 01 C GT GAC C C T GAG GT CGT CAAT GAT GAGT C C T CT CT GGT C AGACAT C GCT GGAAAT GAAT A 15 60 

15 61 G C CAGGC CT GAC C T CAAG CAAC CAT GAAC T C AG CT AT T AAGAAAAT CAC AT T T C C AGGG C 162 0 

I I I I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | M 
15 61 G C CAGG C CT GAC C T CAAGC AAC CAT GAAC T CAGC T AT T AAGAAAAT CAC ATT T C CAGG GC 162 0 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

I ' I I I f I I I I I ' I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I j I I I | | | | | | | | 
1621 AGCAGCCGGGATCGAT GGT GGCGCTTTCTCCTGTGCCCACCCGTCTT CAAT CTCTGTTCT 168 0 

1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 17 4 0 

I I ' I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I | | | | | | | | | I 
1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 17 4 0 

1741 CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAAACCAAAAAAAAAAA 18 00 

' I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I | | | | | I I I I I I I I I I II I I I I I I I I I I I 
1741 CT C C CT ACT T C CAAGAAAAATAAT TAAAAAAAAAACT T CAT T CTAAAC CAAAAAAAAAAA 18 00 

1801 AAAA 1804 
III! 

1801 AAAA 1804 



LT 6 
1516 

AAS11516 standard; cDNA; 1804 BP 



AAS11516; 



24-OCT-2001 (first entry) 



Human cDNA encoding Aspartyl protease 1 (Aspl) . 

Human; Aspartyl protease; Aspl; Asp2 ; beta-secretase; nootropic; 
neuroprotective; amyloid protein precursor; APP; Alzheimer's disease; 
amyloid-beta; Abeta; ss . 



Homo sapiens . 

Ke Y Location/Qualifiers 

CDS l. .1557 

/*tag= a 
/product= "Aspl" 

WO200149098-A2 . 



12-JUL-2001 . 



09-MAY-2001; 2 001WO-IB000798 . 

09-MAY-2001; 2 001WO-IB0007 98 . 

(BIEN/) BIENKOWSKI M J. 

(GURN/) GURNEY M E. 

(HEIN/) HEINRIKSON R L. 

(PARO/) PARODI L A. 

(YANR/) YAN R. 



Bienkowski M J, Gurney ME, Heinrikson RL, Parodi LA, Yan R; 

WPI; 2 001-50254 9/55 . 
P-PSDB; AAU06602. 



Novel purified polypeptide comprising fragment of mammalian aspartyl 
protease 2, lacking Asp2 transmembrane domain and retaining beta 
secretase activity of Asp2 useful for identifying inhibitors of Asp2 
activity . 

Example 2; Fig 1; 18 5pp; English. 

The invention relates to a purified polypeptide comprising a fragment of 
mammalian aspartyl protease (Asp) 2 protein which lacks the Asp2 
transmembrane domain and the Asp2 protein, and where the polypeptide and 
the fragment retain the beta-secretase activity of the mammalian Asp2 
protein. The invention also details polynucleotides for the Asp proteins 
and vectors expressing them, and a polypeptide (isoform of amyloid 
protein precursor (APP) ) comprising the amino acid sequence of an APP or 
its fragment containing an APP cleavage site recognizable by a mammalian 
beta-secretase, and further comprising two lysine residues at the 
carboxyl terminus of the amino acid sequence of the mammalian APP or APP 
fragment. Also included in the invention are methods of identifying 
modulators or inhibitors of Asp2 . Modulators and inhibitors of Asp2 are 
useful for treating Alzheimer's disease. APP is useful in methods for 
identifying inhibitors or modulators of human Asp2 activity and amyloid- 
beta (Abeta) peptide production. APP is also useful in designing 
therapeutics for the treatment or prevention of Alzheimer's disease. APP 



CC comprising the APP-Sw-beta-secretase peptide sequence (NLDA) , which is 

CC associated with increased levels of Abeta processing is useful in assays 

CC relating the Alzheimer's research. The expression vector is useful for 

CC recombinantly expressing APP. Nucleic acids that hybridise to Asp 

CC oligonucleotides are useful as probes or primers. The probes are useful 

CC for detecting Hu-Asp nucleic acids in in vitro assays and in Northern and 

CC Southern blots. The present sequence encodes human Aspl 
XX 



SQ 



Sequence 1804 BP; 397 A; 520 C; 458 G; 429 T; 0 U; 0 Other; 



Query Match 100.0%; Score 1804; DB 4; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 



0; 

60 



ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 

1 1 I I I I I I I I I I I I I I I M I ! I ! I I I M I I I II I I f I II I I I I I I I I I I I | | | M I I I 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

I >> I I I I I I I I I I I f I I I I | | I I I I I I I I I I | | | If II I I I I I I I I | | I I I I I II I I I | | 
GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 120 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

MINIMI I I I I I I I I M I I I I I I I I I | | | | | | I I I I I I I I I I I I I I I | 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 
' M I I I I I I I I I I I I | | | | | | | I I I I I I I M I I I I I I I | | | | | | | | | | | | 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 2 4 0 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 3 00 

II I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I | I I I I I 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

C C C C C GCAGAAGC TAC AGAT T C T C GT T GAC ACT GGAAG CAGT AAC TTTGCCGTGG C AG GA 3 60 

1 1 I I I I ' I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | ! | | | | | 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 
AC C C C GCAC T C C T ACAT AGAC AC GTACT T T GACACAGAGAGGT CT AGCACAT AC C GC T C C 42 0 

MINIMI! N I I I I I I I I I I I I f M I I I II I I I I I I I I I f I | | | | | | | | | | 

AC C C C G CACT C C T ACAT AGAC AC GTAC T T T GAC ACAGAGAG GT C T AGCACAT AC C GCT CC 420 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 8 0 

1 I " I I I I I I I I I I I M I I I I I I I I | | | | | If I I I I I I I II I I I I I I I I 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 8 0 

GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 5 4 0 

■ I I I I I I f I I II I I I I I I I I I I | | | | | | | I I I I I I I I I | I I I I I I I I I | | | I I I I I I I I I 
GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 600 

I ' I I I I N I I I I I I M I I I I I I I I I I M I I I I I I I I II I I I I I I I I I | | | | I I I I I I I I I 

TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 600 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 
1 I " I N I I I I M II I I I I I I I I | M | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 



661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 720 

I 1 I N I I I I I I I I I M I I M I I I I I I M | I | I I | | | | 1 | | | | | | | | M I I I I I I I 

661 CAAGC7WVCATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 78 0 

I I I I I I I I I I I I I I I I II I I I I | I I | | | | | | | | f | | I I I I I I I I I I I I I I I I I I I I I I I I 
721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 78 0 

781 G GAGAC AT CT GGT AT AC C C C TAT T AAG GAAGAGT G GT AC T AC C AGAT AGAAAT T C T GAAA 84 0 

I ' M I I I I I I I I I I I I | | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

781 GGAGAC AT CT G GT ATAC C C C TAT T AAG GAAGAGT G GTAC TAC C AGAT AGAAAT T CT GAAA 84 0 

841 T T GGAAAT T G GAGGC CAAAGC CT T AAT CT GGAC T GCAGAGAGT AT AAC G C AGACAAGG C C 900 

' I I M I I I I I I I I I I I I I I I I I I I | | | | | | | | I I M I I I I I | I | f | | | | | | | | | | 

841 T T G GAAAT T GGAGG C CAAAG C C T T AAT CT GGACT G CAGAGAGT AT AAC GCAGACAAGG C C 9 00 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

<' M I I M I II I I I I I M I I I I I M I I I I I I I I I I I I I I I I | | | | | | | | | | I I I I I | | | | 
901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

9 61 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

1 1 ' 1 I I I I I I I I I II I I I I I I I I I I M I I I I I I I I I I M I I I f I I I I I I I I I | | | | | | M 
961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

1 >> I I I I I I I I I I M I I I | | | | | | | | | | I I I I I I I I I I I I I I I I I | | | M I I I 

1021 CAGCTGGCGTGCT GGAC GAATTCGGAAACACCTT GGT CTTACTTCCCTAAAATCTCCATC 108 0 

1081 TAC C T GAGAGAT GAGAACT C CAG C AGGT CAT T C C GT AT C ACAAT C C T GC C T C AGCT T TAC 114 0 

1 1 1 1 I I I 'I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | || | || | | | | | 
1081 TAC CT GAGAGAT GAGAACT C CAGC AGGT CAT T C C GT AT CACAAT C CT G C CT CAG C T T TAC 114 0 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 0 0 

I I I ' I I I I I I I I I | | | | | | | | I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 

1141 AT T CAGC C CAT GAT GGGGGC C GGC C T GAAT TAT GAAT GT TAC C GAT T C GGC AT T T C C C CA 12 00 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

1 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | M I I I I 
12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

' I ' I I N I I I I I I I I I I M I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | 

1261 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1320 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 1380 

1 ' 1 1 I I I I I I I I I I I I I | | | | I I I I I I I I I I I I I I I | | M I I I I I I I I I | | | M I I I I I I 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

1 1 1 >' I I I I I I | | | | I I I I I I I I I I I I I | | | I I I I I I I I I I I I I | I I I I I II I I I I I I I I 
1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGT GTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

1 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | I I I I M 
14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 15 00 



QY 1501 C GT GACC C T GAGGT C GT CAAT GAT GAGT C CT CT C T G GT C AGAC AT C GCT GGAAAT GAAT A 1560 

I I I I I I I I I I I I I I I I I I I I | I I | | | | | | | | | | | | | | I I I I I M I I I I I I I I I I I 

Db 1501 CGTGACCCT GAGGT CGT CAAT GAT GAGT C CT CT CT GGT C AGAC AT C GCT GGAAAT GAAT A 1560 

Qy 15 61 G C C AGGC CT GAC C T CAAG C AAC CAT GAACT C AGC T AT TAAGAAAAT C ACAT T T C CAGG G C 162 0 

I I I I I I II I I I II I I I I I I I I I | | | | | | | | | | | | | | | | | [ | | || | | | | | | | | | | 

Db 1561 GC CAG G C C T GAC CT CAAG CAAC CAT GAACT CAG C TAT TAAGAAAAT C ACAT TT C CAGGGC 162 0 

Qy 1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

' > I I I II II I I I I II I I I I I M I I I I I I I I I I I I I | | | | | | | | | | | I I I I I I I I I I I I I I 
Db 16 21 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

Qy 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

I I I I I I I I I I I I I I I I I I I | | | | | | M M I I I I I | I I I I I I I I I I I I 

Db 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1740 

Qy 1741 CT C C C T ACTT C CAAGAAAAAT AATTAAAAAAAAAACTT C AT T CTAAAC CAAAAAAAAAAA 18 00 

> I I M I I II I I I I I I | | | | | | M | | | M I I I I I I I I M I I I I I I II I I I I I | | | | 

Db 1741 CT C C CTAC T T C CAAGAAAAAT AATTAAAAAAAAAAC T T CAT T C T AAAC CAAAAAAAAAAA 18 00 

Qy 1801 AAAA 1804 

I I I I 

Db 1801 AAAA 1804 
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Claim 8; Fig 1; 182pp; English. 



CC The present invention describes a human aspartyl protease 1 (hu-Aspl) 
rr substrate (I) which comprises a peptide of no more than 50 amino acids, 
and which comprises the 8 amino acid sequence Gly-Leu-Ala-Leu-Ala-Leu- ' 
Glu-Pro. Also described are: (1) a method (II) for assaying hu-Aspl 
proteolytic activity, comprising: (a) contacting a hu-Aspl protein with 
(I) under acidic conditions; and (b) determining the level of hu-Aspl 
proteolytic activity; (2) a purified polynucleotide (III) comprising a 
nucleotide sequence that hybridises under stringent conditions to the non 
-coding strand complementary to a defined 1804 nucleotide sequence (see 
CC ABL52456) where the nucleotide sequence encodes a polypeptide having Aspl 
rr proteolytic activity and lacks nucleotides encoding a transmembrane 

domain); (3) a purified polynucleotide (III') comprising a sequence that 
hybridises under stringent conditions to (III) (the nucleotide sequence 
encodes a polypeptide further lacking a pro-peptide domain corresponding 
CC to amino acids 23-62 of hu-Aspl (see ABB78589)); (4) a vector (IV) 
CC comprising (III) or (III'); and (5) a host cell (V) transformed or 
CC transfected with (III), (III') and/or (IV). The hu-Aspl protease 

substrate (I) may be used as an enzyme substrate in assays to detect 
aspartyl protease activity, (II) and therefore diagnose diseases 
CC associated with aberrant hu-Aspl expression and activity such as 

Alzheimer's disease. Hu-Aspl has been localised to chromosome 21, while 
hu-Asp2 has been localised to chromosome llq23 . 3-24 . 1 . The present 
CC sequence encodes hu-Aspl from the present invention 



Sequence 1804 BP; 397 A; 520 C; 458 G; 429 T; 0 U; 0 Other; 



Query Match 100.0%; Score 1804; DB 6; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 



0; 



QY 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

I ' I I I I ' I I I I I I I I I I I I I I I I I M I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 



Db 1 



60 



Q y 61 GCCCCG ^GCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

I I I I I I I I I I I I I I I I I I I | | M | | I I I I I f I I I I I I I I I I I I ! I | 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 120 



Db 61 



QY 121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

N I ! I I I f I I M I I I I I I I I I I I I I I 1 I I I I I I I I I I | | | | | | 1 I | | f | | | | | | 

Db 121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 180 

Qy 181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 240 



181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 2 4 0 
241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

" M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I | | I I I I I I M I I I I I I M I I I I I 

241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

301 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 

1 " I f I t I I I I I I II I I I I I I I I I I If M I I I I I I I I I I I I I I I I I I I | | | | I I I 

301 C C C C C GC AGAAG C T ACAGAT T CT C GT T GAC AC T GGAAGCAGTAAC T T T G CC GT GGC AG GA 360 

361 AC C C C GC ACT C CT ACAT AGACAC GTACT T T GACAC AGAGAG GTCT AG C AC AT AC C GC T C C 42 0 

f ' II I I I I I I | | | | | | | | I I I I I I I | | | | | | | | | | | | M I I I I I I M I I I I I I I I 

361 AC C C C G C ACT C CT ACAT AGACAC GT AC T T T GACAC AGAGAGGT C TAG C AC AT AC C GC T C C 42 0 

421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 48 0 

' 1 > I I I I II I i I I I I I I I | | | | I I I M I I I I I I I I I I I I I | | | | | | I I I I I I II I I I II I 
421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 80 

4 81 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

' 1 ' I I I I I I I I I I I I I I II I I I f I I I I | | | | | | f | | | | | | I II M I I I I I I I I I I | | | | | 
4 81 GAC CT C GT CAC CAT C C C C AAAGG CT T CAAT ACT T CTT T T C TT GT CAAC AT T GC C ACT AT T 54 0 

541 T T T GAAT C AGAGAATT T C T T T TT GC C T G GGAT TAAAT GGAAT GGAAT AC T T GGC C TAGCT 600 

'< ' I f I I I M I I I I I I | | | | | | | | | | | | | | | | I I I I I I I | | | | | | | | | | | | | I I I I I I I I 
541 TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 600 

601 T AT GC CAC AC T T GC CAAG C CAT CAAGT T C T C T GGAGAC CT T CT T C GAC T C C CT G GT GACA 660 

" f I I I I I I I I I I I I I I I I | | | | | M I I I I | | | | | | | | | | | | | | | | If I I I I I I I I I I I I 
601 TAT G C CAC ACT T G C CAAGC CAT CAAGT T C T CT GGAGAC C T T C T T C GACT C C CT GGT GACA 660 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 720 

' 1 ' I I I I I I I I I I I I II I I I I I I I I I I I I | | | | | | | | | | | | | | I I I I I I I I I I I I I I I I I 
661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

721 GGAT CT G GGAC CAAC GGAGGT AGT CTT GT CTT GGGT GGAAT T GAAC CAAGT T T GTATAAA 780 

'''< I I I I I I I I I | | | | | | | | | I f I I I I | | | I I I I I I I I | | | | f I I I I I I I I I I I I I I I I 
721 GGAT CTGGGACCAACGGAGGTAGTCTTGT CTT GGGT GGAATTGAACCAAGTTT GTATAAA 7 80 

781 G GAG AC AT C T G GT AT AC C C C TAT T AAG GAAG AGT G G T AC T AC C AGAT AGAAAT T C T G AAA 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | 

7 81 GGAGAC AT CT G GT ATAC C C C TAT TAAGGAAGAGT GGT ACT AC C AGAT AGAAAT T C T GAAA 8 40 

841 TT G GAAAT T G GAGGC C AAAGC CT TAAT C T G GACT GC AGAGAGT AT AAC GC AGACAAGGC C 900 

I I I I I I I I I I I I I I | I I I I I I I I I I | | | | | | | | | | | | | | | | f | | | | | | | 

841 T T G GAAAT T GGAGGC CAAAGC C T TAAT CT GGAC T G CAGAGAGT AT AAC G C AGACAAGGC C 9 00 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

' M > I I I I I I I I I | | | f I I I I I I I I I | M I I I I I I I I I I | | | | | | I I I I I | I I I I I I I I I 
9 01 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

9 61 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

1 1 1 < I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | I I I I I I I I I I | | | I 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1080 
1 ' 1 1 1 1 ' I I I I I I I I I I I I M I I I I I | I I I I I I I I I I I I I I | | | | | | I I I I I I I I 



1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

10 81 T AC C T GAGAGAT GAGAAC T C C AGC AGGT CAT T C C GT AT C AC AAT C CT GC C T CAGC T T T AC 114 0 

I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | II I I I I I I I I I I I I I | M | | | 

1081 T AC C T GAGAGAT GAGAAC T C CAGC AGGT CAT T C C GT AT C ACAAT C C T GC CT CAGC T T T AC 114 0 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 00 

'> I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | M I I I I I | | | | | | | | | | I I I I I 
1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 00 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

1 ' M I I I I I | | | | | | | I I I I I I I I I I I | I I I I I I I I I I I I I I I M I I I I I I 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

1 > N I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I M I I I M I I I I I I I I I I I I I I I 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

1 1 I I M I I I I M I I I I I I I I I I I I I I M | M I I I I I I I I I I II I I I I | | M I I I I I I I I I 
1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

1 1 ' >> I I I I | | | | | | | | | | | | I I I I I I I | I I I I I I I I I I I I I I | | | I I I I I I I I I 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

1 > M I I I I I | | I I I I I I I I I I I I I I I | | | M I I I I I I I I I | | | I I I I I I I I I M 

1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

1501 CGTGACCCTGAGGTCGTCAATGATGAGTCCTCTCTGGTCAGACATCGCTGGAAATGAATA 1560 

1 ' ' I I I I I I I I I II I I I I I I I M I I I I I I I I I I I I I I | | | M I I I I I I I I I I I | I I I I I I 
1501 C GT GAC C CT GAGGT C GT CAAT GAT GAGT CCTCTCTGGT C AGACAT C GC T GGAAAT GAAT A 1560 

1561 GC CAGGC C T GAC CT CAAGCAAC CAT GAAC T C AG C TAT TAAGAAAAT CAC AT TT C C AGG G C 162 0 

I I I I I I I M I M I I I I I I I I I I | I I I I I I I I I | | I I I I I I I M I I I I I I I 

1561 G C CAG G C CT GAC CT CAAG CAAC CAT GAACT CAGC TAT TAAGAAAAT CACAT TT C C AGGGC 162 0 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

1 > I I I I M M I I I I I I I I I I I I I I I I I I II I I I I I I I | | | I I I I I I I I I I I M I I I I I I I 
1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 17 4 0 

' 1 " I I I I I I I I | | | | | | | | | I I I I I I I | | | | | | | | | I I I I I I I | | | | I I I I I I I I I I I I 
1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

17 41 CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAAACCAAAAA7\AAAAA 1800 

I I II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I f I I I I I I I I 

1741 C T C C CT AC TT C CAAGAAAAATAAT T AAAAAAAAAACT T C ATT CT AAAC CAAAAAAAAAAA 18 00 

1801 AAAA 1804 
I I I I 

1801 AAAA 1804 
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DR P-PSDB; AAY41714. 
XX 

New secreted and transmembrane polypeptides and their polynucleotides, 
useful for treating blood coagulation disorders, cancers and cellular 
PT adhesion disorders. 
XX 

PS Claim 2; Fig 72; 530pp; English. 
XX 



CC 
CC 
CC 
CC 
CC 



The present invention describes secreted and transmembrane polypeptides 
and their polynucleotides. The nucleotide sequences are useful as sources 
of probes, primers, for chromosome mapping, and for generation of 
antisense sequences. They can also be used to create transgenic animals. 
The proteins can be used to treat a variety of diseases and disorders 
CC depending on their function. Diseases that may be treated include blood 
CC coagulation disorders, cancers and cellular adhesion disorders. They may 
CC also be used to raise antibodies. AAZ33891 to AAZ34338, and AAY41685 to 
CC AAY41774 represent polynucleotide and polypeptide sequence given in the 
CC exemplification of the present invention 



XX 
SQ 



Sequence 1879 BP; 388 A; 559 C; 498 G; 434 T; 0 U; 0 Other; 



Query Match 98.9%; Score 1784.4; DB 2; Length 1879; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1785; Conservative 0; Mismatches 1; Indels 0; Gaps 



0; 



Qy 


1 


Db 


94 


Qy 


61 


Db 


154 


Qy 


121 


Db 


214 


Qy 


181 


Db 


274 


Qy 


241 


Db 


334 


Qy 


301 


Db 


394 


Qy 


361 


Db 


454 


Qy 


421 


Db 


514 



ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

' I M M I I I I M I I I I I I I I I I I I I I I I I I I I I I I | | | M I I I I I I I I I I I | | | M I I II 

94 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 153 
GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

' I I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I I If I I I I I I I M I I 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 213 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

1 " M I I I I I I I If I I I I | | | | | I I I I I I I I I I I I I I I I I I I | | | | | | | | I I I 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 27 3 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 

1 1 > I I I I M I I I II I I I I I I I I I I I I | | | | I I I I I I I I I I I I f I I | | I I I I II I I M M I 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 333 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 
I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | ,| | , | | | , , , | | 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 3 93 
CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 

M ! M M I I I I I I I I I I If I II I I I I f I I I I I I I I | { | | | | | M II I I I I I I I I I I I I I I 

C C C C C G C AGAAGCT ACAGAT T C T C GT T GACACT GGAAG CAGT AAC T T T G C C GT GG CAGGA 4 53 
AC C C C GC ACT C CT ACAT AGACAC GT ACT TT GACAC AGAGAGGT CT AGC AC ATAC C G CT C C 42 0 

1 ' I I I M I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I 

AC C C C G C AC T C C T AC AT AGACAC GT AC T T T G AC AC AGAGAG G T C TAG C AC AT AC C G C T C C 513 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 8 0 
1 > > I I If I I I I I I | | | | | | | | | | | | | | | || I I I | | | | | | | | | | | | | M | | 



481 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCT^CATTGCCACTATT 54 0 

I I M N I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I II I I I M I I I I I I I I I I 

574 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 633 
541 T T T GAAT CAGAGAAT TTCTTTTTGCCT G GGAT TAAAT G GAAT GGAAT ACT T G G C CT AGCT 600 

1 << I I If I I f I I f I I I I I I I f I I | I I | M I I I I I I I f I I I I I M I M I I I I M I I M I I I 

634 TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 693 

601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 

I I I I I ' ' I I I I I I I | | | | | | I M I I I I I I I I I I I I I I | | | | | | I I I I I | I I I 

694 TAT GC C ACAC T T GC CAAGC CAT CAAGT T CT C T GGAGAC C T T C T T C GACT C C CT GGT GAC A 753 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

nc 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | , | | | , 

7 54 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 813 

721 GGAT CTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTAT AAA 780 

'" i I I I I I I I I I I I I I If I I I I I M I I I I I I I | | | | | | I I I I I | | | M I I I I I I | | | | | 
814 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 873 

781 GGAGAC AT C T GGT AT AC C C CTAT T AAGGAAGAGT GGTAC T AC C AGAT AGAAAT T CT GAAA 84 0 

MINIMI I I I I I I I | | | | | | | | | | | | | | | , | | | | f | I I I I M I ! I I I I 

87 4 GGAGAC AT C T GGT AT AC C C CTAT T AAGGAAGAGT GGT ACT AC CAG AT AGAAAT T CT GAAA 933 
841 T T G GAAATT GGAGGC C AAAGC CT TAAT CT G GACT G C AGAGAGT AT AAC GC AGACAAGGC C 90 0 

I I I I I I I I I I I I I M I I I I I I I M I I I I M | | | | | | | | | | | | | M | | | | , , , , | , 

934 T T GGAAAT T G GAGGC CAAAGC C T TAAT C T GGACT GC AGAGAGT AT AAC GCAGACAAG GC C 993 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

1 '! I f I I I I I I I I I I I I I I I I I I I I II I I I I | | | | | | I I I I I I I M I II I I II I I I I I I | 
994 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1053 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 
1 1 > I I I M I I I I I I I | | | | I I I I I I I I | | | | | | | I M I I I I I I I I I I I I | I I I I I I I I I I 
1054 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCT GAT GGTTTCT GGACT GGGTCC 1113 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

' '> I I I I I I I I I I I I | | | | | | | | | | | | | I I I I | | | | | | | | | | | | | | | | I I I I I I I I I I I I 
1114 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1173 

10 81 T AC C T GAGAGAT GAGAACT C C AGC AG GT C ATT CC GT AT C ACAAT C CT G C C T CAG C TT T AC 114 0 

1 1 > I I I I I I II I M I I I | | | | | | | | | | | | | | | | | || | | | | , | , , | ,| | |,|| |, 

117 4 T AC C T GAGAGAC GAGAACT C C AGC AGGT CAT T C C GTAT C ACAAT C C T GC CT C AGCT T T AC 1233 

1141 AT T C AGC C CAT GAT GGGGGCCGGCCT GAAT TAT GAAT GT T AC C GAT T C GG C AT TT C C C CA 12 0 0 

I I I I I I I I I I I I I | | | | | | | | | | M | | | | | | I I I I I I I I I I I I I I | I I | | 

1234 ATT CAGCCCATGATGGGGGCCGGCCTGAATTAT GAAT GTTACCGATTCGGCATTTCCCCA 1293 

1201 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1260 

1 1 > I I M I I I I I I | | I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | I I II I | | | 

1294 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1353 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

M 1 1 " I I I I I II I I I I I I I I I I I I I I I I I I I | | | | | II I I I I || | | I 

1354 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1413 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1414 



1381 



1474 



1441 



1534 



1501 



1594 



1561 



1654 



' I M I I I i I I I I I I I I I I I I I i I I I I I I I I I I | | | | | M I I I I I I I I M | | | | | | 

GTGTCTG7VAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 

CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 
I I M I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | I | 

CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 



1473 



1440 



1533 



1500 



GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 
' I I M I I I I I I I | | M | | | M I I I I I | i | | M I I I I I | | | | M | f | | | | | | 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1593 
C GT GAC C CT GAG GT C GT CAAT GAT GAGT CCTCTCTGGT C AGAC AT C GC TGGAAAT GAAT A 1560 

I I I I I ' ' I I M I I II I M I I I I I I I I I I I I I I I II I II I I I I I | I I I I I II II I I I I I I I 

C GT GAC C CT GAGGT C GT CAAT GAT GAGT CCTCTCTGGT C AGAC AT C G CT GGAAAT GAAT A 



G C C AG GC C T GAC C T CAAGCAAC CAT GAAC T C AGCT AT TAAGAAAAT C ACAT T T C C AGGGC 

' I N I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I | | | | | II I I I II I I I I I 

GC C AGGC C T GAC CT CAAGCAAC CAT GAACT CAG CT AT TAAGAAAAT C AC AT T T C C AG G G C 



1653 



1620 



1713 



1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 
1 ' I I I I I I I I I I I I I I I | | | | | | | | | I I M I I I I I | | | | | | | | | | | M I I I I I I I I I I I I 
AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 177 3 



1714 



1681 



1774 



1741 



1834 



GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 17 4 0 

1 ' I I M M I I I I I | | | | | | | I I I I I I I I I I | | | | | | | | | | | | I I I I I I I I I I I I I I I I I I 
GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 18 33 

CT C C CT ACT T C C AAGAAAAAT AATTAAAAAAAAAAC T TC AT T CT AA 178 6 

' I I I I M I I I I I I I I I I I I I I I I I If I I I I I I I I I I I I I I I I I I I I 

CT C C CT AC T T C CAAGAAAAAT AAT TAAAAAAAAAACT T CAT T C TAA 187 9 
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PR 28-APR-1999 

PR 14-MAY-1999 

PR 23-JUN-1999 

PR 26-JUL-1999 

PR 29-OCT-1999 

PR 30-NOV-1999 

PR 02-DEC-1999 

PR 02-DEC-1999 

PR 16-DEC-1999 

PR 30-DEC-1999 

PR 30-DEC-1999 

PR 05-JAN-2000, 

PR 06-JAN-2000, 

PR 06-JAN-2000; 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Ashkenazi AJ, Baker KP, Botstein D, Desnoyers L, Eaton DL; 

PI Ferrara N, Filvaroff E, Fong S, Gao W, Gerber H, Gerritsen ME; 

PI Gocidard A, Godowski PJ, Grimaldi CJ, Gurney AL, Hillan KJ; 

PI Kljavin IJ, Kuo SS, Napier MA, Pan J , Paoni NF, Roy MA, Shelton DL; 

PI Stewart TA, Tumas D, Williams PM, Wood WI ; 

XX 

DR WPI; 2000-611443/58. 

DR P-PSDB; AAB44270. 
XX 

PT Novel PRO polypeptides and polynucleotides used in detection methods, to 

PT target bioactive molecules to specific cells, and to modulate cellular 

PT activities. 
XX 

PS Claim 2; Fig 72; 636pp; English. 
XX 



AAC78458 to AAC78599 represent polynucleotide and EST (expressed sequence 
tag) sequences which encode secreted or transmembrane PRO polypeptides. 
The PRO polynucleotides and polypeptides have cytostatic activity. The 
polynucleotides and polypeptides can be used for detecting the presence 
CC of PRO polypeptides in samples, for linking bioactive molecules to cells 
and for modulating biological activities of cells, using the polypeptides 
for specific targeting. The polypeptide targeting can be used to kill the 
target cells, e.g. for the treatment of cancers. The polypeptide pairs 
CC provide specific targeting of bioactive molecules to cells. AAC78600 to 
CC AAC78987 represent PCR primers and probes used in the isolation of the 
CC PRO polynucleotide sequences 
XX 

SQ Sequence 1879 BP; 388 A; 559 C; 498 G; 434 T; 0 U; 0 Other; 

Query Match 98.9%; Score 1784.4; DB 3; Length 1879; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1785; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

QV 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

>< N M I I I I 1 I I I I I I I I I I I I I I I | | f | | | | | | | | | | | | ; | | | | | , | | | ,,,,,,,, , 

Db 9 4 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 153 

Qy 61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

1 I N N I I I I I | | | | I | | | | | | | | | | | | | || | | | | | | || | | | ,, , | ,| , , | 

Db 154 GCCC CGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 213 



121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 
' I I ' I I I ' I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | j | | 

214 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 2 73 

181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 2 4 0 

1 1 1 I I I I I I f I I I I I I | | I I | | I | | | M I I I I I I I | | | | | | | I f M I I f I I I | | | | | M I 

2 74 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 333 

241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

'''''''' I I ' I I I I I I I I I I I I I | I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | j | | 
334 GT AGAC AAC C T GC AG GGG GACT C T GGC C G C G G CT AC T AC C T GGAGAT GC T GAT C GGGAC C 393 

3 01 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 

1 1 1 1 1 I N f II I I I I I I I I II I I I M I I I I I I I I I I I M I M I I I I I I I | | | | M M I I I 
394 CCCCCG CAGAAGC T AC AGAT T CT CGT T GAC ACT GGAAGCAGT AAC TT T G C C GT G GC AGGA 453 

361 AC C C C G C AC T C C T AC AT AGAC AC GT ACT T T GACAC AGAGAGGT CT AGC AC AT AC C GC T C C 42 0 

1 ' 1 < I I I I I I I I I I I I M I I I II I I I II I I I I I I I II I I I I I I I | I I I I I I I I I | | | | | | 

4 54 AC C C C GCACT C C T AC AT AGAC AC GT AC T TT GACAC AGAGAG GT CT AG C AC AT AC C G CT C C 513 

421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 80 
" 1 1 1 > I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | f | | | | | | ,,,,,,,, | , , ( , 

514 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 573 

4 81 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

' 1 1 1 1 >' I I I I I I I I I I | | | M I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | I I I II 
574 GAC CT C GT CAC CAT C C C CAAAGGC T T CAAT AC T T C T T TT C T T GT CAACAT T GC C ACT AT T 633 

541 T T T GAAT CAGAGAAT TTCTTTTTGCCT GGGAT TAAAT GGAAT GGAAT AC TT G GCCTAGC T 600 

1 1 11 1 > I I I I I I I I I | | | | I I I I I I I I I I | | | M | | | | I I I I I I I I | | | I | | | II I I I I I 
634 TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAAT GGAAT GGAAT ACTTGGCCTAGCT 693 

601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 

1 1 ' I I I I I I I I I I I | | | | | | I I I I M I I I | | | | | I I I I I I I I I I I f | | | | | | | | | | | | | | 
694 TAT GC C ACACTT G C C AAGC C AT CAAGT T C T CT G GAGAC CT T C T T CGACT C C CT GGT GAC A 753 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 
1 1 1 I I I I I I I I I I I I | | | | I I I M I I I I I I I I I I I I I | | | | | || 

7 54 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 813 

721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 78 0 

1 1 > I I I I I I I I I I I I I I I M I I I I I I I I I I I I f f I I I | | | I I I I I I | | | | | | | | 

814 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 873 

781 GGAGACAT CT GGTATAC CC CTATTAAGGAAGAGT GGTACTAC CAGATAGAAATT CT GAAA 84 0 

1 11 " I I I I I I I I I I I | | | | M I I I I I I | | I I I I I I I I I I I I I I | | | I M I I I I 

8 74 G GAGACAT CT GGT AT AC CC C TAT T AAGGAAGAGT G GT ACT AC CAGAT AGAAAT T CT GAAA 933 

841 T T G GAAAT T G GAGGC C AAAGC CT TAAT C T GGAC T G CAGAGAGTAT AAC G C AGACAAGGC C 900 

""> I I I I I I I I I I I I I I | I | | I I I I I I I I I I | | | | | | | | | I I I I j I | | | | | | | 

934 T T G GAAAT T GGAG G C CAAAG C C T TAAT C T GGACT GC AGAGAGT AT AAC GCAGAC AAG GC C 993 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

1 1 ' 1 > I I I I I I I I I I I M I I I I I I I I I I | I I I I I I | | | | | | I I I I I I I 

994 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1053 



961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 

I ' I I I M M I I I ! I I I I I M I II I I I I I I I M I I I t I I I f I I I I ! I I I I M I I I I I I I I I 

1054 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1113 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 10 8 0 

I I I I I ' I I I II I I I I I f I I I I I | | M I I I M I I I I I I I I I I I I I I I I I | | | | | | | | | | | | 
1114 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1173 

10 81 T AC C T GAGAGAT GAGAACTC C AGC AG GT CATT C C GT AT C ACAAT C CT GC CT CAG CT T T AC 114 0 

M I I I I I I I I I I I I I I M I I I I I | | | | I I I I I II I I I I I | | f M I M I I I I I I I I | | M 
1174 T AC C T GAGAGAC GAGAACT C CAG CAG GT CAT T C C GT AT C AC AAT C C T G C C T C AG CT T T AC 1233 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1200 

" I I i m I M I I I I I I I I I I I I I I I | | | | | | | | | | M | | I I I I I I I I I I I | | | | 

12 34 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 93 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1260 

I I I I I I I I I I I N I II I I I I I I I | | | | | | | | M I M I I I I I I I I I | | | | | | | | | | | | || | 
12 94 T C C ACAAAT GC G C T G GT GAT C GGT G C C AC G GT GAT GGAGG GC T T CT AC GT CAT CTT C GAC 1353 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1320 

I I I ' I I I I II I I I I I I I I I I I I I I I I | | | | | | I I I I I I I I | | | | | | | | | I I I I I I I I I I I 
1354 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1413 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

1 I I I I I ' I I I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I I I I I 

1414 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 14 73 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 144 0 

' I I I I I M I I I I I I M I I I I I I I I I | | | | | I I I I I I I I I II I I I I I I I I I I I I 

1474 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1533 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 
' I I I I I I I I M M I II I I I I I I | | | | | | | | | | M | | | | | | | | | | | | | | | | | | | , | 

1534 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1593 

1501 C GT GAC C CT GAG GT C GT CAAT GAT GAGT C CT C T C T GGT C AGACAT C GCT GGAAAT GAAT A 1560 

' I I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I I I I I I I I I I I I I M I II I I I 

1594 CGT GAC C CT GAGGT C GT CAAT GAT GAGT CCTCTCTGGT C AGACAT C G C T G GAAAT GAAT A 1653 

1561 GC C AG G C C T GAC CT CAAGCAAC CAT GAACT C AGCTAT T AAGAAAAT C AC ATT T C C AGGGC 162 0 

1 I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1654 GC C AGGC CT GAC C T C AAG CAAC CAT GAAC T C AGC TAT TAAGAAAAT C AC ATT T C CAG G G C 1713 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | I I I I I | | | | I I I I I 

1714 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 17 73 

1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

1 I I I I II I I I I I I I I I I I I I I I I I I I I I I | | | | | | || | | | | | ( | | | | | | | | | 

1774 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1833 

1741 CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAA 17 8 6 
I I I N I I I I II I I I I | | | | | | | | | | | | | || | | | | | | | | | | | | ,j | | 

1834 CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAA 1879 



RESULT 10 
AAS45960 

ID AAS45960 standard; cDNA; 1879 BP. 
XX 

AC AAS45960; 
XX 

DT 18-DEC-2001 (first entry) 
XX 

DE Human DNA encoding PRO polypeptide sequence #36 
XX 

KW PRO polypeptide; mammal; tumour; cancer; human; cattle; horse; sheep; ss; 

KW dog; cat; pig; goat; rabbit; tumour necrosis factor alpha; TNF-alpha; 

KW blood; chondrocyte cell; cell proliferation; cell differentiation; colon; 

KW adrenal; lung; breast; prostate; rectum; cervix; liver; genetic disorder; 

KW PCR primer. 

XX 

OS Homo sapiens . 
XX 

PN WO200168848-A2. 
XX 

PD 20-SEP-2001. 
XX 

PF 28-FEB-2001; 2001WO-US006520 . 
XX 
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PR 04-APR-2000; 2 000US-01 944 4 9P . 
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PR ll-APR-2000; 2 00 0US- 01 9 60 OOP . 

PR ll-APR-2000; 2000US-01961 87P . 

PR ll-APR-2000; 2 0O0US-01 96690P . 

PR ll-APR-2000; 2 000US-0196820P . 

PR 18-APR-2000; 2 000US-0198 12 IP . 

PR 18-APR-2000; 2 000US-01 98585P . 
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PR 25-APR-2000; 2 000US-0199654 P . 
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PR 30-MAY-2000; 2 000WO-US014 94 1 . 
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2 8-JUL-2 0 00; 2 000WO-US02 0710 . 
22-AUG-2 000; 2 000US-00644 84 8 . 
24-AUG-2000; 2 000WO-US02332 8 . 
08-NOV-2000; 2 000WO-US030952 . 
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(GETH ) GENENTECH INC. 

Baker KP, Chen J, Desnoyers L, Goddard A, Godowski PJ, Gurney AL; 
Pan J, Smith V, Watanabe CK, Wood WI, Zhang Z; 

WPI; 2001-602746/68. 
P-PSDB; AAU29059. 

Novel nucleic acids encoding PRO polypeptides, used to diagnose the 
presence of tumors, such as prostate and breast tumors, in mammals and t 
screen for modulators of the compounds. 

Claim 2; Fig 71; 774pp; English. 

Sequences AAS45925-AAS46231 represent DNA molecules encoding and PCR 
primers for PRO polypeptides of the invention. The sequences of the 
invention can be used to detect the presence of a tumour in a mammal by 
comparing the level of expression of a PRO polypeptide in a test sample 
of cells from the animal and a control sample of normal cells, whereby a 
higher level of expression in the test sample indicates the presence of a 
tumour m the mammal. Mammals include dogs, cats, cattle, horses, sheep 
pigs, goats and rabbits but are preferably human. The polypeptides can be 
used to stimulate tumour necrosis factor (TNF) alpha release from human 
blood, when contacted with it. A specific polypeptide can be used to 
stimulate the proliferation or differentiation of chondrocyte cells. The 
PRO proteins can be used to determine the presence of tumours and also 
susceptibility to tumour development, particularly adrenal, lung, colon 
breast, prostate, rectal, cervical, or liver tumours, in mammalian 
subjects. The oligonucleotide probes specific for the PRO nucleic acids 
can be used for genetic analysis of individuals with genetic disorders 

Sequence 1879 BP; 388 A; 559 C; 498 G; 434 T; 0 U; 0 Other; 

ery Match 98.9%; Score 1784.4; DB 4; Length 1879; 

st Local Similarity 99.9%; Pred. No. 0; 

tches 1785; Conservative 0; Mismatches 1; Indels 0; Gaps 0 



1 
94 
61 
154 



ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 
1 ' 1 " I I I I I I I I I I I I I I I I | | f | | ! M I I I I I I I t ! | | | | f | I | | | | | | | | | 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 153 
GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

1 ' 11 1 ' I I I I I I I I I I f I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! | | | | | | | M 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 213 



121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 180 

1 1 1 1 " I M I I I I I I I I I I I If I I I I I II I I I I I | | | | | | | | | | | | | | | | || | | | 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 273 



214 



181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 



2 74 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 333 

241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 30 0 
<' M f f I I I I I I I ! | | | f| I I I I I I I | I | | | | | | | | | | 1 1 | | | | | | | | | | | | | | | ,,, | | 

334 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 393 

301 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 
I I I I I ' I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | , , , ,| | | | | | | | | 

394 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 4 53 
361 AC C CC GCACT C CT AC AT AGAC AC GT AC T T T GAC ACAGAGAG GT C TAG CACAT AC C G CT CC 42 0 

> I I I I I I I I I ! I M I I I f I I I I f | | | | | I | | | | | f | ! ff I I I I I I f| I II I I I I I I I I I I 

4 54 AC C C C G CAC T C C T ACAT AGAC AC GTACT T T GACACAGAGAGGT CT AGC AC AT AC C GC T C C 513 

421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 80 

I I I I I I I I I I I I I I | | M I I I I I M M I I I I I I I | | | | | | | ff 

514 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 573 

481 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

I I I I M M I I I I I I I I I I I I I I I I I I | | I) I I I I I I I I I I I I I I I I I I I I I I I I 

57 4 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 633 

541 T TT GAAT C AGAGAAT TTCTTTTTG C CT G GGATTAAAT GGAAT GGAATACT T G G C C T AGCT 600 

1 > I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I f I I I I I I I I I I I I I I I I 
634 TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 693 

601 TAT GC CAC ACT T G C CAAG C CAT CAAGT T CT C T G GAGAC CT T CT T C GACT C C CT GGT GACA 660 

I I M I M I I I I I I I I I I | | | | | | | | | | I I I I I I I I I I I I | | | | | | | | | | I I I I I I I I I I I 
69 4 TAT G C CACACT T GC CAAGC CAT CAAGT T CT C T GGAGAC CT T C T T C GAC TCCCTGGT GACA 753 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

I ' I I I I I I I I I I I I I I I I I I I | M I I I I I I | | | | | | | | | | | | f | 

754 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 813 

721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 78 0 

1 ! I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | I I I I I I I I I | 
814 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 873 

781 GGAGACAT CT GGT AT AC C C CT AT T AAG GAAGAGT GGT ACT AC C AGAT AGAAAT T CT GAAA 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

87 4 G GAGAC AT C T GGT AT AC C C CT AT T AAGGAAGAGT GGT ACT AC CAGAT AGAAAT T CT GAAA 933 

841 TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 90 0 

I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | I I | | | | I I I I II I I 

934 T T G GAAAT T GGAGG CCAAAGC C T T AAT C T GGACT GCAGAGAGT ATAAC G C AGAC AAGG C C 9 93 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | M I I I I I I I I I I I I I I | | I I I I 
994 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1053 

9 61 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 
M I I I I I I I I I I I I I I | | || || | | | | | | | | | | | | | | | | | | f | , | | | | , | | | | , | , 

1054 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1113 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1080 
>N N I I I I I I I I I I I I I | I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | I 



Db 


1114 


: CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 


1173 


Qy 


1081 


T AC CT GAGAGAT GAGAAC T C C AG C AGGT C AT T C C GT AT C ACAAT C C T G C C T C AGC T T T AC 
1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 | 1 I 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 | 1 1 I 1 M 
T AC C T GAGAGAC GAGAAC T C CAGC AGGT CAT T C C GT AT C AC AAT C C T GC C T C AGCT T T AC 


1140 


Db 


1174 


1233 


Qy 


1141 


ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 
II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 M 

ATT CAGC C CAT GATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCC CCA 


1200 


Db 


1234 


1293 


Qy 


1201 


TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 
1 1 1 1 1 1 II 1 1 1 1 i M 1 1 1 II 1 I 1 1 I I I 1 1 1 1 M ! 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 

TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 


1260 


Db 


1294 


1353 


Qy 


1261 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 

1 1 1 ! 1 1 1 1 1 1 M 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I I I 1 1 1 M m ii 

1 11 1 ' 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


1320 


Db 


1354 


1413 


Qy 


1321 


GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 
M 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


1380 


Db 


1414 


1473 


Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 
1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1440 


Db 


1474 


1533 


Qy 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 
1 1 N 1 1 f 1 1 1 1 | | | | | | | j M 1 1 1 1 1 1 1 1 1 1 1 M 1 M II 1 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 


1534 


1593 


Qy 


1501 


C GT GAC CCT GAG GT C GT CAAT GAT GAGT C C T CT CT GGT C AGAC AT C G CT GGAAAT GAAT A 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 
C GT GAC CCT GAGGT C GT CAAT GAT GAGT CCTCTCTGGT C AGAC AT C GC T GGAAAT GAAT A 


1560 


Db 


1594 


1653 


Qy 


1561 


GC CAG GC CT GAC CT CAAGCAAC CAT GAACT C AG CT AT T AAGAAAAT C AC AT TT C CAG GGC 
N 1 1 1 1 1 1 1 1 1 | | | 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 I 1 
GC C AGG C C T GAC CT CAAGCAAC CAT GAACT C AGCT AT TAAGAAAAT C ACAT T T C C AGG GC 


1620 


Db 


1654 


1713 


Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 
1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


1714 


1773 


Qy 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 

11111111)11 llllltllta ii* t. 

1 1 1 1 1 I 1 1 II 1 1 1 1 1 1 1 1 I I I I I I I I II 1 1 1 1 1 1 I 1 1 I I | | | | 1 1 M I 1 1 1 1 1 1 I 1 1 1 1 1 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1774 


1833 


Qy 


1741 


CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAA 17 8 6 
1 1 M 1 1 II 1 1 1 1 1 | | | | | | || | | | | | | | | | | | | | | | | | | | | | | | | | 

C T CC C T AC TT C C AAGAAAAAT AAT TAAAAAAAAAACT T CAT T C T AA 187 9 




Db 


1834 





RESULT 11 
ABX78563 

ID ABX78563 standard; cDNA; 1879 BP. 
XX 

AC ABX7 85 63; 
XX 



DT 15-APR-2003 (first entry) 
XX 

DE Human PRO polynucleotide #3 6. 
XX 

KW Human; PRO; gene; ss; cytostatic; tumour; cancer; breast; lung; stomach; 

KW liver; dog; cat; cow; horse; sheep; pig; goat; rabbit; ADEPT; 

KW antibody-dependent enzyme mediated prodrug therapy 
XX 

OS Homo sapiens . 
XX 

PN US2 003 027272-A1. 
XX 

PD 06-FEB-2003. 
XX 

PF 21-JUN-2002; 2002US-00 17 64 92 . 
XX 

PR 18-SEP-1997; 97US-00592 63P . 

PR 18-SEP-1997; 97US-00592 66P . 

PR 17-OCT-1997; 97US-00 62250P . 

PR 21-OCT-1997; 97US-00634 8 6P . 

PR 24-OCT-1997; 97US-006312 OP . 

PR 24-OCT-1997; 97US-006312 IP . 

PR 28-OCT-1997; 97US-006354 OP . 

PR 28-OCT-1997; 97US-006354 IP . 

PR 28-OCT-1997; 97US-0063544P . 

PR 28-OCT-1997; 97US-0063564P . 

PR 29-OCT-1997; 97US-0063734P . 

PR 31-OCT-1997; 97US-0063870P . 

PR 31-OCT-1997; 97US-0064 103P . 

PR 13-NOV-1997; 9 7US- 00 65 3 IIP . 

PR 21-NOV-1997; 97US-0066120P . 

PR 24-NOV-1997; 97US-00 664 66P . 

PR 24-NOV-1997; 97US-00667 72P . 

PR ll-DEC-1997; 97US-0069335P . 

PR 12-DEC-1997; 97US-0069425P . 

PR 17-DEC-1997; 97US-00698 70P . 

PR 18-DEC-1997; 97US-0068 017P . 

PR 10-MAR-1998; 98US-0077450P . 

PR ll-MAR-1998; 98US-0077 632P . 

PR ll-MAR-1998; 98US-0077649P . 

PR 20-MAR-1998; 98US-007 8 8 8 6P . 

PR 2 0-MAR-1998; 98US- 007 8 93 9P . 

PR 27-MAR-1998; 98US-0079664P . 

PR 27-MAR-1998; 98US-007 97 86P . 

PR 31-MAR-1998; 98US-008 0 1 07P . 

PR 31-MAR-1998; 98US-008 0194P . 

PR 01-APR-1998; 98US- 008 0327 P . 

PR 01-APR-1998; 98US-008 0333P . 

PR 08-APR-1998; 98US-008 104 9P . 

PR 08-APR-1998; 98US- 008 107 OP . 

PR 09-APR-1998; 98US-008 1 195P . 

PR 15-APR-1998; 98US- 008 18 3 8P . 

PR 21-APR-1998; 98US-0082568P . 

PR 21-APR-1998; 98US-0082569P . 

PR 22-APR-1998; 98US- 00827 04 P . 

PR 22-APR-1998; 98US-00 827 97P . 

PR 28-APR-1998; 98US- 0083322P . 
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PR 
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SEP- 


1998; 


98US- 


0100930P. 
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-1998; 
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-SEP- 


-1998; 


98US- 


-0101068P. 
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-SEP- 
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0102571P. 
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01- 
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1998; 
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0102684P. 


PR 


01- 


-OCT- 


1998; 


98US- 


0102687P. 


PR 


02- 


-OCT- 


1998; 


98US- 


0102965P. 


PR 


06- 


-OCT- 


1998; 


98US- 


0103258P. 


PR 


0 6- 


-OCT- 


1998; 


98US- 


0103449P. 


PR 


07- 


-OCT- 


1998; 


98US- 


00168978. 



Query Match 98.9%; Score 1784.4; DB 7; Length 1879; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1785; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 

1 1 ! 1 M 1 1 I 1 1 I 1 I! 1 1 1 1 f 1 1 1 1 1 1 1 1 ! I i I | | | | | M M 1 1 1 f 1 1 1 I 1 1 I 1 1 I I I I | | 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


94 


153 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 
1 1 1 f N II 1 1 1 1 I I M I I I | | i | | | | | | | ( | | | | | | | | | | | | | | | | | | || | | j | | J | , | | 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


154 


213 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

' 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I 1 1 1 1 1 1 1 1 | | | | | 1 1 1 1 II 1 i 1 1 1 1 1 1 1 1 1 | | | | | | 1 1 1 
CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


214 


273 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

N 1 1 1 1 I 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 I I I I I I I I I I I I I M 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


274 


333 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 
N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | M | | | | | 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


334 


393 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 

1 1 1 1 N 1 1 1 I 1 1 1 1 M M 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I | | 

C C C C C GCAGAAG CT AC AGAT T CT C GT T GAC AC T G GAAG C AGT AAC TT T GC C GT GGC AG GA 


360 


Db 


394 


453 


Qy 


361 


AC C C C G CACT C CT AC AT AGACAC GT ACT T T G ACAC AGAGAG GT C TAG CACAT AC C GCT C C 

1 1 1 N 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 I 1 1 1 1 1 1 1 I I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 II 1 1 

AC C C C G CACT C CT AC AT AGACAC GT AC T T T G ACAC AGAGAG GT C TAG CACATACC GCT C C 


420 


Db 


454 


513 



Qy 


421 


AAGGGCTT TGAC GT CACAGT GAAGTAC ACACAAGGAAGCT GGAC RRfirTT c r;TT rrr r a a 

M f ' M I 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 I 1 1 1 1 ! | ! i 1 1 I M I I I I ! I | | M 1 1 1 1 1 1 1 1 1 I 1 1 1 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


4 o U 


Db 


514 


c: n o 
D / O 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATArTTrTTTTrTT^TrnarnTTrrrarfrQTT 

1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 

GAC C T C GT C AC CAT C C C CAAAG G CT T C AAT AC TTCTTTTCTTGT CAAC AT T GC C AC TAT T 




Db 


574 


iZ. "3 "3 

boo 


Qy 


541 


" 1 - J - ^■nw.nvJ.ru-i ± J-XV--XXXXX X *J\JVJJ^\.X -L.rtJ-Vrt._L O^-M-tH.! Ljtj J rt/\J. 1 X (jtjL*l~- 1 /\Cj(— - X 
I 1 1 1 1 I 1 1 1 i 1 M 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 i 1 1 II 1 1 1 1 1 I I I I I | | | | | | | | | | | | | | | | 

TT T GAAT C AGAGAAT T T CT T T TT GC C T G GGAT TAAAT GGAAT GGAAT ACT T GGC CT AGC T 


s~ r\ C\ 
OU 0 


Db 


634 


c fi o 
by o 


Qy 


601 


TATGCCACACTTGCCMGCCATfAAGTTrTrTn^A^ArrTTrTTrrarTrrr'Trr'prAnn 

1 1 1 1 1 1 1 1 1 1 M II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I I I I | | | | | | | | | | | | | | | 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


IZ /Z C\ 

b b(J 


Db 


694 


"~1 c o 
/DO 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTrrATr;rAf;ATr;Tf;Trrarrr , rrr ,r rTrr'r'r i rTT'rprTi 
N 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 M I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


— 7 O n 

/z U 


Db 


754 


£"> "1 O 

813 


Qy 


721 


GGATCTGGGACCAACGGAGGTAGTrTT(^TrTTf;r;r:Tr;r:aATTr2iz_rr7iflr f r r rTr'rnAm7\ t\ t\ 

1 1 M 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 II 1 1 1 

GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 


/ o 0 


Db 


814 


O "~t o 

8 7 3 


Qy 


781 


GGAGACATCTGGTATACCCCTATTAAf;f;AA(^AnT^CTarTArraraTaraflnrpm r .m r 7\7\ 7* 

M M 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I I 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 

GGAGACAT CT GGTATAC CCCTATT AAGGAAGAGT GGTACTACCAGATAGAAATT CTGAAA 


8 4 0 


Db 


874 


o o o 

y oo 


Qy 


841 


TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 | | | | | | | | | M 1 1 1 1 1 1 1 1 1 1 1 
TT GGAAAT T GGAG GC CAAAGC C T T AAT CT GG ACT G CAGAGAGT AT AAC G CAGAC AAG GC C 


900 


Db 


934 


993 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 
i 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 II | | | || | | | | | | | | | | | | | | M | | | | | | | | | | | 

ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 


Db 


994 


1053 



Qy 961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 

I M I I I I I I I I I I | | || | | | | | | | | | | | | | | | | | M | | | | | | | | | | | | | | | | | | | | | | | | 

Db 1054 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1113 

Qy 1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1114 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1173 

Qy 1081 T AC C T GAGAGAT GAGAAC T C C AGC AG GT CATT CC GT AT CACAAT C CT GC CT C AGC TT T AC 1140 

I I I I I I I N II I I II | | | | | M I I 1 I I I I I I I (I I M | || M | | | | | | 

Db 1174 TAC CT GAGAGAC GAGAAC T C CAGC AGGT CATT CC GT AT CACAAT C C T G C CT C AGCT T T AC 1233 

Qy 1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 00 

I I N I II I II II | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | j | | | | || | | | | |, 

Db 1234 ATT CAGC C CATGAT GGGGGCCGGC CT GAATT AT GAAT GTTACCGATTCGGCATTTCCCCA 1293 

Qy 1201 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1260 



12 94 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1353 



Qy 


1261 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 

i M 1 1 1 ! 1 I 1 1 1 1 1 1 f 1 1 1 M 1 1 1 1 1 ! 1 1 1 1 I 1 1 1 1 1 1 1 1 1 M II 1 I 1 M 1 1 1 1 1 I I t I I 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


1320 


Db 


1354 


1413 


Qy 


1321 


GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 
1 N i M II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


1380 


Db 


1414 


1473 


Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 
1 M II 1 1 1 1 1 1 1 1 I 1 1 | | | | | | | | | 1 | | | 1! | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 

CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1440 


Db 


1474 


1533 


Qy 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 | | | 1 1 1 1 1 1 1 1 | | | 1 | | | | | | | | | | | J J J | | 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 


1534 


1593 


Qy 


1501 


C GT GAC C C T GAGGT C GT C AAT GAT GAGT C C T CT C T GGT C AGAC AT C G CT G GAAAT GAAT A 
M 1 1 1 1 1 1 1 1 1 II 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 [ 1 1 1 I 1 I 1 M 1 1 1 1 1 1 | | | 1 1 1 1 1 1 1 | 1 | | | | | | | | | 

C GT GAC C CT GAG GT C GT CAAT GAT GAGT CC T CT C T GGT C AGACAT C GC T GGAAAT GAAT A 


1560 


Db 


1594 


1653 


Qy 


1561 


G C CAG GC CT GAC C T C AAG C AAC CAT GAACT CAG C TAT TAAGAAAAT CAC AT T T C C AGGG C 

1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 | | | | | | | | | | | M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l 
* ■ * ' 1 ' ■ ■ ■ ■ > ■ 1 1 > > i i i p i i i i i i i i ( i ) i i i i i i i i i i i i i i j I I I I l j j 1 1 1 1 1 1 1 

GC CAG GC CT GAC CT CAAG CAAC CAT GAACT CAGCTAT TAAGAAAAT CAC AT T T C C AGG GC 


1620 


Db 


1654 


1713 


Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 
I 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I i I I i 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


1714 


1773 


Qy 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 

1 1 N II 1 1 1 1 1 1 ! 1 1 M 1 1 1 | | | | | | 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 | | | | | | | | | | 1 1 1 1 
GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1774 


1833 


Qy 


1741 


CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAA 17 8 6 

1 1 1 M M M 1 1 1 1 1 1 II 1 1 II I | | | | | | | | | | | | | | | | | | | M 1 1 1 

CT C C CT ACT T C CAAGAAAAAT AAT T AAAAAAAAAACT T CAT T CTAA 18 7 9 




Db 


1834 





RESULT 12 
ACA75535 

ID ACA75535 standard; cDNA; 1879 BP. 
XX 

AC ACA75535; 
XX 

DT 07-JUL-2003 {first entry) 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 
XX 

OS Homo sapiens. 
XX 

PN US2 003 0 32 12 7-A1 
XX 



Novel human secreted and transmembrane protein PR08 52 cDNA. 

Human; secreted and transmembrane protein: PRO; gene therapy; 
tumour necrosis factor-alpha release; TNF-alpha release; 
chondrocyte proliferation; chondrocyte differentiation; tumour; 
adrenal tumour; lung tumour; colon tumour; breast tumour; 

prostate tumour; rectal tumour; cervical tumour; liver tumour; gene; ss . 



PD 13-FEB-2003. 
XX 

PF 26-JUN-2002; 2 002US-00183012 . 
XX 

PR 18-SEP-1997; 97US-005 9263P . 

PR 18-SEP-1997; 97US-0059266P . 

PR 17-OCT-1997; 97US-0062250P . 

PR 21-OCT-1997; 97US-0063486P . 

PR 24-OCT-1997; 97US-0063 12 OP . 

PR 24-OCT-1997; 97US-006312 IP . 

PR 28-OCT-1997; 97US-006354 OP . 

PR 28-OCT-1997; 97US-006354 IP . 

PR 28-OCT-1997; 97US-006354 4 P . 

PR 28-OCT-1997; 97US-0063564 P . 

PR 29-OCT-1997; 97US-0063734P . 

PR 31-OCT-1997; 97US-0 0 63870P . 

PR 31-OCT-1997; 97US-0064 1 03P . 

PR 13-NOV-1997; 97US-0065311P . 

PR 21-NOV-1997; 97US-0066120P . 

PR 24-NOV-1997; 97US-00664 66P . 

PR 24-NOV-1997; 97US-0066772 P . 

PR ll-DEC-1997; 97US- 0069335P . 

PR 12-DEC-1997; 97US-006942 5P . 

PR 17-DEC-1997; 97US-0069 87 OP . 

PR 18-DEC-1997; 97US-0068 017P . 

PR 10-MAR-1998; 98US- 00774 50P . 

PR ll-MAR-1998; 98US-0077 632 P . 

PR ll-MAR-1998; 98US-0077649P . 

PR 20-MAR-1998; 98US-0078886P. 

PR 20-MAR-1998; 98US-0078 939P . 

PR 27-MAR-1998; 98US-0079664P . 

PR 27-MAR-1998; 98US-0079786P . 

PR 31-MAR-1998; 98US-008 0107P . 

PR 31-MAR-1998; 98US-008 0194 P . 

PR 01-APR-1998; 98US-008 0327P . 

PR 01-APR-1998; 98US-008 0333P . 

PR 08-APR-1998; 98US-0081049P . 

PR 08-APR-1998; 98US-008 1070P . 

PR 09-APR-1998; 98US-008 1195P . 

PR 15-APR-1998; 98US- 008 1838P . 

PR 21-APR-1998; 98US-00825 68 P . 

PR 21-APR-1998; 98US-0082 569P . 

PR 22-APR-1998; 98US-00 827 04 P . 

PR 22-APR-1998; 98US-0082 7 97P . 

PR 28-APR-1998; 98US-0083322P . 

PR 29-APR-1998; 98US-00834 95P . 

PR 29-APR-1998; 98US-00834 96P . 

PR 29-APR-1998; 9 8US- 00834 99P . 

PR 29-APR-1998; 98US-008355 9P . 

PR 05-MAY-1998; 98US-0084 366P . 

PR 06-MAY-1998; 98US-008 4 4 14P . 

PR 07-MAY-1998; 98US-0084 639P . 

PR 07-MAY-1998; 9 8US-008 4 64 OP . 

PR 07-MAY-1998; 98US-0084 64 3P . 

PR 15-MAY-1998; 9 8US-008557 9P . 

PR 15-MAY-1998; 98US-008 5580P . 

PR 15-MAY-1998; 98US-00855 82P . 
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-0086023P. 
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PR 


22 


-MAY 


-1998 


98US 


-0086486P. 
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PR 


18- 


-AUG- 


-1998 


98US- 


-0096959P. 


PR 


18- 


-AUG- 


-1998 


; 98US- 


-0097022P. 


PR 


2 6- 


-AUG- 


-1998 


98US- 


-0097952P. 


PR 


26- 


-AUG- 


-1998 


; 98US- 
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PR 
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-0097955P. 


PR 
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-1998, 


' 98US- 


-0097974P. 


PR 


2 6- 


-AUG- 


-1998, 


; 98US- 


-0098014P. 


PR 


01- 


-SEP- 


-1998, 


r 98US- 


-0098716P. 


PR 


01- 


-SEP- 


-1998, 


: 98US- 


-0098723P. 


PR 


02- 


-SEP- 


-1998; 


: 98US- 


-0098803P. 


PR 


02- 


-SEP- 


-1998; 


98US- 


-0098821P. 


PR 


02- 


-SEP- 


-1998; 


98US- 


-0098843P. 


PR 


09- 


-SEP- 


-1998; 


98US- 


-0099602P. 


PR 


10- 


-SEP- 


-1998 ; 


98US- 


-0099741P. 


PR 


10- 


-SEP- 


-1998; 


98US- 


-0099754P. 


PR 


10- 


-SEP- 


■1998; 


98US- 


-0099763P. 


PR 


10- 


SEP- 


■1998; 


98US- 


-0099812P. 


PR 
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■SEP- 


1998; 


98US- 


•0100388P. 


PR 
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SEP- 
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98US- 


0100662P. 


PR 
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SEP- 


1998; 


98US- 


0100664P. 


PR 


16- 


SEP- 


1998; 


98US- 


0101751P. 


PR 
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SEP- 
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98WO- 


US019330. 


PR 


17- 


SEP- 


1998; 


98US- 


0100683P. 


PR 


17- 


SEP- 


1998; 


98US- 


0100684P. 


PR 
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SEP- 
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98US- 


0100919P. 
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SEP- 


1998; 


98US- 
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PR 
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1998 ; 


98US- 
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PR 
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0101014P. 


PR 
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1998 ; 
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PR 
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98US- 


0101471P. 


PR 
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SEP- 


1998 ; 
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PR 
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SEP- 
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98US- 


0101475P. 


PR 
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SEP- 


1998; 


98US- 
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PR 


24- 


SEP- 


1998; 
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0101738P. 


PR 
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SEP- 


1998; 
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PR 


24- 


SEP- 


1998; 
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0101743P. 


PR 


24- 


SEP- 


1998; 
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0101922P. 


PR 


25- 


SEP- 
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PR 
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0102207P. 


PR 


29 
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29 
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PR 


29 


-SEP- 
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0102331P. 


PR 


30 


-SEP- 


1998; 


98US- 


0102487P. 


PR 


30 


-SEP- 


1998; 


98US- 


0102570P. 


PR 


30 


-SEP- 


1998; 


98US- 


0102571P. 


PR 


01- 


-OCT- 


1998; 


98US- 


0102684P. 


PR 


01- 


-OCT- 


1998; 


98US- 


0102687P. 


PR 


02- 


-OCT- 


1998; 


98US- 


0102965P. 


PR 


06- 


-OCT- 


1998; 


98US- 


0103258P. 



Query Match 98.9%; Score 1784.4; DB 7; Length 1879; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1785; Conservative 0; Mismatches 1; Indels 0; Gaps 



Qy 


l 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 
t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 M 1 1 M 1 1 1 1 II I 1 1 I t 1 I 1 I I I I M I i i i i i i i i t i i i i 

' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 I I M 1 | 1 | | | | | | | | | | | j | | | | | | | | | 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


94 


153 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 

1 1 I 1 1 I M ! M M 1 1 ! I 1 ! 1 1 II 1 1 1 1 I 1 i 1 1 1 1 1 l l I I i i ( m i i i i t i i i i i i i i i i i 
1 1 1 1 1 1 11 1 1 11 1 1 1 1 1 1 1 i i i i i i i i i i i i i i i i i i i i i ii i i i i i i i i i 1 i i i | | i | | 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


154 


213 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I i i i i i 
' • 1 1 1 1 1 1 1 1 1 < 1 1 1 i i i i i i i i i i i i i i it i i i i i ii ii I i i I I I I II I I 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


214 


273 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 I 1 1 M 1 I I 1 1 1 1 1 I I I I I I I I I 1 M I I I I I I I I | 1 | | 1 I | | | 
1 1 1 1 1 1 1 1 1 1 > > i i i i i i i i I i i i i i i i i i i i i i i i i i i ii | | | | | | | | 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


274 


333 


Qy 


241 


GTAGAC7^ACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 

1 1 1 1 1 f 1 1 M 1 1 1 1 1 1 1 I 1 1 1 1 M 1 1 1 1 ! 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 I I f I I I 1 1 1 1 I I i I 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


334 


393 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 
1 1 f N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | M | | | | | M 1 1 1 1 f M 1 1 1 1 ! 1 1 1 I I I I I I | | | | | 
C C C C C G CAGAAGC T AC AGAT T C T C GT T GAC AC T GGAAGC AGT AACT T T G C C GT GG CAGGA 


360 


Db 


394 


453 


Qy 


361 


AC C C C GCAC T C CT AC AT AGAC AC GT AC T TT GAC ACAGAGAGGT CT AG C ACAT AC C GC T C C 

1 1 N I 1 1 1 1 1 1 1 I 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I M 1 I 1 1 1 1 1 M 1 1 M 1 1 I 1 1 

AC C C C GC AC T C C T ACAT AGAC AC GTAC T TT GAC ACAGAGAGGT CT AGC AC AT AC C GC T C C 


420 


Db 


454 


513 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

1 1 1 1 f 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M II 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


514 


573 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 
1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 
GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


574 


633 


Qy 


541 


TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 I 1 1 1 1 1 1 
T T T GAAT C AGAGAAT TTCTTTTTGCCTG GGAT T AAAT GGAAT GGAATACTT GGC C TAG C T 
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TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 
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694 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 753 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

M t I I I I I I I I I M I I I I I I I M I I | | f | | | | | | | ] | | | | 1 | | | | M I I 1 I I I | M I I I I 

7 54 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 813 

721 G GAT CT GGGAC CAAC G GAGGT AGT C T T GT C T T GG GT GGAAT T GAAC CAAGT T T GT ATAAA 780 

i i I N I I I I I f I I I | | | | | | | | | | | | | | | | | | | | | | | | | | I I I I I I I I I I I I I I I | | I f I 
814 G GAT C T G GGAC C AACGGAGGT AGT CTTGTCTT GG GT GGAAT T GAAC CAAGT T T GT AT AAA 87 3 

781 GGAGAC AT C T GGT AT AC C C C TAT T AAG GAAGAGT GGTAC TAC CAGATAGAAAT T C T GAAA 84 0 
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RESULT 13 
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XX 
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XX 

DT 02-AUG-2003 (first entry) 
XX 

DE Human secreted/ transmembrane protein (PRO) cDNA #36. 
XX 

KW Human; gene; ss; secreted and transmembrane protein; PRO; TNF-alpha; 

KW tumour necrosis factor alpha; chondrocyte cell; tumour; gene therapy; 

KW tissue typing. 
XX 

OS Homo sapiens. 
XX 

PN US2003032112-A1 . 
XX 
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XX 

PF 21-JUN-2002; 2002US-00176756 . 
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Query Match 98.9%; Score 1784.4; 

Best Local Similarity 99.9%; Pred. No. 0; 



DB 7; Length 1879; 



Matches 1785; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 
1 f 1 1 1 M 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 II 1 1 M I 1 1 1 1 I 1 1 1 1 1 1 1 1 I 1 I I 
ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


94 


153 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 

i ' i 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 I M 1 1 1 II 1 1 1 1 1 MM 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


154 


213 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

1 1 M 1 1 1 M 1 1 1 1 1 1 1 M 1 1 II 1 M 1 1 1 1 M M 1 1 II 1 1 1 1 1 1 1 II II 1 II 1 1 1 1 I M II 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


214 


273 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 II II 1 M 1 1 1 1 II II 1 1 1 M M 1 II II 1 1 II M 1 M M II 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


274 


333 


Qy 


241 


GT AGACAAC CT G CAG GG GGACT CT G G C C GC G G C TACT AC CT G GAGAT GC T GAT C G G GAC C 

M 1 1 1 II 1 II 1 II 1 1 M II II 1 1 1 1 II M 1 1 1 II II II 1 1 1 M I 1 II 1 1 II II 1 II 1 1 II 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


334 


393 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 

1 1 1 M II 1 II 1 1 II II II 1 1 II 1 1 II 1 1 1 1 1 1 M 1 1 1 II M M II 1 1 II 1 II IE 1 1 1 1 II 
C C C C C G CAGAAGCT AC AGAT T CT C GT T GAC ACT G GAAGC AGT AACT T T GC C GT GGCAG GA 


360 


Db 


394 


453 


Qy 


361 


AC CC C GCAC T C CT ACAT AGAC AC GT ACT T T GAC AC AGAGAGGT C T AGCAC AT AC C GC T C C 
1 1 1 1 M i 1 1 M II II 1 1 M 1 1 1 II II 1 1 1 II 1 1 1 1 1 II 1 I II I II 1 1 1 II 1 1 II 1 1 II II 
AC CC C GCAC T C CT AC AT AGACAC GT ACT T T GAC AC AGAGAGGT C TAG CAC AT AC C GCT C C 


420 


Db 


454 


513 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 
1 1 1 1 1 1 1 1 M 1 II II 1 1 II 1 II 1 1 M 1 II 1 1 1 II 1 1 II II II M M 1 1 1 1 II 1 1 II II II 
AAGG GC T T T GAC GT CACAGT GAAGT AC ACACAAGGAAGC T GGAC GG G CT T C GTT G GGGAA 


480 


Db 


514 


573 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 

1 1 1 1 M i 1 1 1 1 II 1 1 II 1 II 1 1 II 1 1 1 1 1 M 1 1 II M II ! 1 1 1 1 II 1 1 1 1 1 II 1 M 1 M 1 

GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


574 


633 


Qy 


541 


T T T GAAT C AGAGAAT TTCTTTTTGCC T GGGAT TAAAT G GAAT GGAAT AC T T G GC CTAG CT 

M 1 II 1 II II 1 II II II II 1 II 1 1 II 1 II 1 1 M II 1 II II II II M 1 1 1 1 1 1 M 1 II 1 1 1 

T T T GAAT C AGAGAAT TTCTTTTT GC CT GGGAT TAAAT GGAAT G GAAT ACT T GGC C T AGCT 


600 


Db 


634 


693 


Qy 


601 


TAT GC C AC ACT T GC CAAG C CAT C AAGTT C T C T GGAGAC C T T C T T C GAC T C C CT GGT GACA 

1 M 1 1 II 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 II II II 1 1 II 1 1 II 1 1 II II 1 M 1 

TAT GCCACACTT GCCAAGC CAT CAAGTTCTCT GGAGAC CTTCTTCGACTCCCT GGT GACA 


660 


Db 


694 


753 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 

1 1 1 1 1 1 M II 1 II I 1 1 1 1 M 1 1 II 1 1 1 II M 1 M II II 1 1 1 II 1 1 II 1 1 1 t 1 1 1 II II 1 1 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


754 


813 


Qy 


721 


GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 

1 1 1 1 1 1 II 1 II II 1 II II 1 II 1 II 1 If II 1 1 1 II II II M II II II 1 1 1 1 1 II II I 1 II 1 
GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 


780 


Db 


814 


873 


Qy 


781 


GGAGACAT CT GGTAT ACCC CTATTAAGGAAGAGT GGTACT AC CAGAT AGAAATT CT GAAA 
M 1 II 1 II 1 1 1 II II 1 1 1 1 1 1 II 1 1 II 1 II 1 1 1 II 1 II 1 II II 1 II 1 | | 1 | || || | II II 
G GAGACAT C T GGT AT AC C C CT AT T AAG GAAGAGT GGT ACT AC CAGAT AGAAAT T CT GAAA 


840 


Db 


874 


933 



Ov 
v:y 


841 


TTRnAAATT^^AC^rTAAAnrrTTaATrTf^^ArT^rA^ararTaTa A.^'^*^*A.^^A^*aAr"'r*^ , ^ , 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 


n n n 

y u U 


Db 


934 


r\ o o 

y y 3 


Ov 


901 

J_ 


1 M I M i 1 1 E 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I 1 1 1 1 1 1 1 ! 1 1 I M 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 

ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGT GTTTGATGCGGTGGTG 


CI tz C\ 

y ou 


Db 


994 


1 Do 3 


Ov 


961 


l -J J rVrUjV_, lui kj^jk^L*^^L-Lj^/\l ^-1^1 <crrt.l 1 L L/\^/\/\ 1 1L1L1 L/\X LjLj 1 1 1 L 1 CjLjAL 1 CjGG 1 CC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I I I II | | | | 

GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 


102 0 


Db 


i n S4 

X W — 1*1 


1113 


Ov 

wy 


1 091 


W-V<s3^ 1 ^^k^kj 1 X Lit Lj/\L-- bj-MjH. 1 X LbTbj>\M/\L/\LL 1 I Lbrl L- 1 1/\L 1 1 LLL1 AAAATCTCCATC 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I I I I I I I I I I | | | | | 

CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 


108 0 


Db 


1114 

J — L X 


1173 


Ov 
wy 


-i- \J \J X 


x.tt.v_-^ x ^f^jJ-\\jJ-\±. X \^J-\kj \^J-\\j br 1 X LLLj1/\1 LALnAl LL 1 bLL 1 LAbrL 1 11 AO 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 II 1 1 

TACCTGAGAGACGAGAACTCCAGCAGGTCATTCCGTATCACAATCCTGCCTCAGCTTTAC 


114 0 


nb 


1 1 74 


1233 


Ov 


1141 

J L T X 


^-L 1 LnlaL.UL/il {jJ-\± brbrbjbrbjLUjbiLL- X 1 J\l O.A/\l brl IALLIjAx 1 LbbLATTTCCCCA 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I I I I I I I I I | I | | 
ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 


12 0 0 


Db 


19^4 


1293 


Ov 

^y 


1 9 01 

X 4-* \J X 


TrrArAAAT^rrr , TrrTraTrrrTrrr&rrrT i raT i rrarrrpmTipm7\r , r , T>P7\rpr t mrTir>r'7\r' 
j. v_ ^^y^j-\^\t\ x i brbr X br/\l Hj^jx brLLALLrbi I bx.M.1 LL-A.bjbr(jL 1 1 L- 1 ACCjI LAI C 1 1 C GAC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 1 

TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 


12 60 


Db 


± £. 1/ T 


1353 


Ov 

wy 


1 Ofil 
lil. U i 


A(^Al^r , r , l"A.^AAfTIA.frir"^T n ^^/"r^ A 7\ f~nrn r~> A /~>rc\ /-> i-*rn r~> t\ 
Ab/\lj^bU/ib/AKbHbbb i brbrbrLX 1 LOLAOLO/\bjLLL.L. 1 bl LLALAAA1 1 bLAbbi bCTGCA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


132 0 


Db 


1 1 S4 

X .J T 


1413 


Ov 

vy 


1 3?1 


i^i^ i 1 l i bbbbbbb i 1 ILl L/\AL/\L/\Lj OA 1 Cj 1 AGCLAGCAACTGTGTCCCCGCT 

1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


138 0 


Db 


14 14 


14 73 


Ov 


1381 

X \J X 


±. '"ill w-Uj^^/^^^L^tt-i i i lbi LLjAX iblbi LblAl LjOQjLI LAI LjALLGI LI Gl GGA 

1 1 1 1 1 1 1 1 i M i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 

CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1 A A f\ 

14 4 0 


Db 


14 74 

XT / " 


lb Ji 


Ov 
^y 


14 41 


± x iblbl lr\n.l ^ V7 X bbl bLl brL-1 LjLLLt 1 1 LLbbibl LACjLLjX LCjULUL 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


i c r\ n 

lbUU 


Db 


15 34 


i a o 


Ov 


1501 


rCT(^ArrrT(^AnrJTri^TrAATr!A r r(^A^ r rrr r rr r rP'T , r'r r T i r i arRPBTirrrTrrn a a rpr* a a t 1 a 

1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 I I t 1 I 1 1 1 M I 1 1 1 1 1 1 1 1 ! E I 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 

CGTGACCCTGAGGTCGTCAATGATGAGTCCTCTCTGGTCAGACATCGCTGGAAATGAATA 


lobU 


Db 


1594 


1 boo 


Qy 


1561 


GC C AGGC C T GAC CT CAAG CAAC CAT GAAC T C AGCT AT T AAGAAAATC AC AT T T C CAGGG C 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 1 I I 1 1 1 1 

G C C AG G C C T GAC CT C AAGCAAC CAT GAACT C AG C TAT TAAGAAAATC AC AT T T C CAGG GC 


1620 


Db 


1654 


1713 


Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 
1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I I I I I | | | | | | | I | | | | | | | | | | | | | | | M 1 1 1 1 1 [ 1 1 1 | 1 1 1 ! 
AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


1714 


1773 



Qy 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 17 4 0 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I ! I I I f I I I I I I I I I I I M I I I 
Db 17 74 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 18 33 

Qy 1741 CTCCCTACTTCCAAG7\AAAATAATTAAAAAAAAAACTTCATTCTAA 17 8 6 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I | I I I M I I I I I 
Db 1834 CT C C C T ACT T C CAAG AAAAAT AAT T AAAAAAAAAAC T T CATT CT AA 1879 



RESULT 14 




ACC87543 




ID 


ACC87543 standard; cDNA; 1879 BP. 


XX 






AC 


ACC87543; 




XX 






DT 


05-AUG-2003 


(first entry) 


XX 






DE 


Human secreted polypeptide PR0852-encoding cDNA, SEQ ID NO: 71. 


XX 






KW 


Human; PRO; 


secreted protein; transmembrane protein; 


KW 


extracellular domain; tumour necrosis factor-alpha; TNF-alpha; 


KW 


chondrocyte ; 


proliferation; differentiation; cartilage disorder; 


KW 


bone disorder; arthritis; sports injury; cancer; tumour; diagnosis; 


KW 


adrenal tumour; lung; colon; breast; prostate; kidney; rectum; cervix; 


KW 


liver; drug 


screening; transgenic animal; genetic analysis; 


KW 


antiarthritic; vulnerary; gene therapy; gene; ss. 


XX 






OS 


Homo sapiens 


* 


XX 






PN 


US2003027278 


-Al. 


XX 






PD 


06-FEB-2003 . 




XX 






PF 


21-JUN-2002; 


2002US-00176987 . 


XX 






PR 


18-SEP-1997; 


97US-0059263P. 


PR 


18-SEP-1997; 


97US-0059266P. 


PR 


17-OCT-1997; 


97US-0062250P. 


PR 


21-OCT-1997; 


97US-0063486P. 


PR 


24-OCT-1997 ; 


97US-0063120P. 


PR 


24-OCT-1997 ; 


97US-0063121P. 


PR 


28-OCT-1997 ; 


97US-0063540P. 


PR 


28-OCT-1997; 


97US-0063541P. 


PR 


28-OCT-1997; 


97US-0063544P. 


PR 


28-OCT-1997 ; 


97US-0063564P. 


PR 


29-OCT-1997; 


97US-0063734P. 


PR 


31-OCT-1997; 


97US-0063870P. 


PR 


31-OCT-1997; 


97US-0064103P. 


PR 


13-NOV-1997; 


97US-0065311P. 


PR 


21-NOV-19 97; 


97US-0066120P. 


PR 


24-NOV-1997; 


97US-0066466P. 


PR 


24-NOV-1997; 


97US-0066772P. 


PR 


ll-DEC-1997; 


97US-0069335P. 


PR 


12-DEC-1997; 


97US-0069425P. 


PR 


17-DEC-1997; 


97US-0069870P. 


PR 


18-DEC-1997; 


97US-0068017P. 


PR 


10-MAR-1998; 


98US-0077450P. 



PR 


11 


-MAR 


-1998; 


98US 


-0077632P. 


PR 


11 


—MAR 


-1998; 


98US 


-0077649P. 


PR 


20 


-MAR 


-1998; 


98US 


-0078886P. 


PR 


20 


-MAR 


-1998; 


98US 


-0078939P. 


PR 


27 


-MAR 


-1998; 


98US 


-0079664P. 


PR 


27 


-MAR 


-1998; 


98US 


-0079786P. 


PR 


31 


-MAR 


-1998; 


98US 


-0080107P. 


PR 


31 


-MAR 


-1998; 


98US 


-0080194P. 


PR 


01 


-APR- 


-1998; 


98US 


-0080327P. 


PR 


01 


-APR- 


-1998; 


98US 


-0080333P. 


PR 


08 


-APR- 


-1998; 


98US 


-0081049P. 


PR 


08 


-APR- 


-1998; 


98US 


-0081070P. 


PR 


09- 


-APR- 


-1998; 


98US- 


-0081195P. 


PR 


15- 


-APR- 


-1998; 


98US- 


-0081838P. 


PR 


21- 


-APR- 


-1998; 


98US- 


-0082568P. 


PR 


21- 


-APR- 


-1998; 


98US- 


-0082569P. 


PR 


22- 


-APR- 


-1998; 


98US- 


-0082704P. 


PR 


22- 


-APR- 


-1998; 


98US- 


-0082797P. 


PR 


28- 


-APR- 


-1998; 


98US- 


-0083322P. 


PR 


29- 


-APR- 


-1998; 


98US- 


-0083495P. 


PR 


29- 


-APR- 


-1998; 


98US- 


-0083496P. 


PR 


29- 


-APR- 


-1998; 


98US- 


-0083499P. 


PR 


29- 


-APR- 


-1998; 


98US- 


-0083559P. 


PR 


05- 


-MAY- 


-1998; 


98US- 


-0084366P. 


PR 


06- 


-MAY- 


-1998; 


98US- 


-0084414P. 


PR 


07- 


-MAY- 


-1998; 


98US- 


-0084639P. 


PR 


07- 


-MAY- 


-1998; 


98US- 


-0084640P. 


PR 


07- 


-MAY- 


-1998; 


98US- 


-0084643P. 


PR 


15- 


-MAY- 


-1998; 


98US- 


-0085579P. 


PR 


15- 


-MAY- 


■1998; 


98US- 


-0085580P. 


PR 


15- 


-MAY- 


-1998; 


98US- 


-0085582P. 


PR 


15- 


-MAY- 


-1998; 


98US- 


-0085700P. 


PR 


18- 


-MAY- 


-1998; 


98US- 


-0086023P. 


PR 


22- 


-MAY- 


1998; 


98US- 


-0086392P. 


PR 


22- 


-MAY- 


■1998; 


98US- 


-0086486P. 


PR 


28- 


-MAY- 


■1998; 


98US- 


■0087098P. 


PR 


28- 


MAY- 


1998; 


98US- 


■0087208P. 


PR 


02- 


■ JUN- 


1998; 


98US- 


0087609P. 


PR 


02- 


JUN- 


1998; 


98US- 


0087759P. 


PR 


03- 


JUN- 


1998; 


98US- 


0087827P. 


PR 


04- 


JUN- 


1998; 


98US- 


0088025P. 


PR 


04- 


JUN- 


1998; 


98US- 


0088028P. 


PR 


04- 


JUN- 


1998; 


98US- 


0088029P. 


PR 


04- 


JUN- 


1998; 


98US- 


0088033P. 


PR 


04- 


JUN- 


1998; 


98US- 


0088326P. 


PR 


05- 


JUN- 


1998; 


98US- 


0088167P. 


PR 


05- 


JUN- 


1998; 


98US- 


0088202P. 


PR 


05- 


JUN- 


1998; 


98US- 


0088212P. 


PR 


05- 


JUN- 


1998; 


98US- 


0088217P. 


PR 


09- 


JUN- 


1998; 


98US- 


0088655P. 


PR 


10- 


JUN- 


1998; 


98US- 


0088722P. 


PR 


10- 


JUN- 


1998; 


98US- 


0088738P. 


PR 


10- 


JUN- 


1998; 


98US- 


0088740P. 


PR 


10- 


JUN- 


1998; 


98US- 


0088811P. 


PR 


10- 


JUN- 


1998; 


98US- 


0088824P. 


PR 


10- 


JUN- 


1998; 


98US- 


0088825P. 


PR 


10- 


JUN- 


1998; 


98US- 


0088826P. 



PR 


11 


-JUN- 


-1998 


98US- 


-0088861P. 


PR 


11 


-JUN- 


-1998 


98US- 


-0088863P. 


PR 


11 


-JUN- 


-1998 


; 98US- 


-0088876P. 


PR 


12 


-JUN- 


-1998 


98US- 


-0089090P. 


PR 


12 


- JUN- 


-1998 


; 98US- 


-0089105P. 


PR 


16 


-JUN- 


-1998 


98US- 


-0089512P. 


PR 


16 


-JUN- 


-1998 


98US- 


-0089514P. 


PR 


17 


-JUN- 


-1998 


; 98US- 


-0089538P. 


PR 


17 


- JUN- 


-1998 


98US- 


-0089598P. 


PR 


17 


-JUN- 


-1998 


98US- 


-0089653P. 


PR 


18 


-JUN- 


-1998 


98US- 


-0089908P. 


PR 


19- 


-JUN- 


-1998 


98US- 


-0089952P. 


PR 


22 


- JUN- 


-1998 


; 98US- 


-0090246P. 


PR 


22- 


-JUN- 


-1998 


98US- 


-0090252P. 


PR 


22- 


-JUN- 


-1998 


98US- 


-0090254P. 


PR 


24- 


- JUN- 


-1998 


; 98US- 


-0090429P. 


PR 


24- 


-JUN- 


-1998 


98US- 


-0090435P. 


PR 


24- 


-JUN- 


-1998 


98US- 


-0090444P. 


PR 


24- 


-JUN- 


-1998 


98US- 


-0090461P. 


PR 


24- 


-JUN- 


-1998 


98US- 


-0090535P. 


PR 


24- 


- JUN- 


-1998, 


; 98US- 


-0090540P. 


PR 


25- 


-JUN- 


-1998, 


; 98US- 


-0090676P. 


PR 


25- 


- JUN- 


-1998 ( 


; 98US- 


-0090678P. 


PR 


25- 


-JUN- 


-1998, 


r 98US- 


-0090688P. 


PR 


25- 


-JUN- 


-1998, 


; 98US- 


-0090690P. 


PR 


25- 


-JUN- 


-1998, 


; 98US- 


-0090694P. 


PR 


25- 


-JUN- 


1998, 


; 98US- 


-0090695P. 


PR 


25- 


- JUN- 


■1998, 


; 98US- 


0090696P. 


PR 


26- 


-JUN- 


•1998, 


i 98US- 


00105413 . 


PR 


26- 


-JUN- 


1998, 


98US- 


•0090862P. 


PR 


2 6- 


-JUN- 


1998 ; 


98US- 


0090863P. 


PR 


2 6- 


-JUN- 


1998; 


98US- 


■0091010P. 


PR 


01- 


-JUL- 


1998; 


98US- 


0091359P. 


PR 


01- 


- JUL- 


1998; 


98US- 


0091544P. 


PR 


02- 


-JUL- 


1998; 


98US- 


0091478P. 


PR 


02- 


-JUL- 


1998; 


98US- 


0091486P. 


PR 


02- 


-JUL- 


1998; 


98US- 


0091626P. 


PR 


02- 


-JUL- 


1998; 


98US- 


0091628P. 


PR 


02- 


-JUL- 


1998; 


98US- 


0091632P. 


PR 


24- 


-JUL- 


1998; 


98US- 


0094006P. 


PR 


04- 


-AUG- 


1998; 


98US- 


0095282P. 


PR 


10- 


-AUG- 


1998; 


98US- 


0095998P. 


PR 


10- 


-AUG- 


1998; 


98US- 


0096012P. 


PR 


17- 


-AUG- 


1998; 


98US- 


0096757P. 


PR 


17- 


-AUG- 


1998; 


98US- 


0096766P. 


PR 


17- 


-AUG- 


1998; 


98US- 


0096867P. 


PR 


17- 


-AUG- 


1998; 


98US- 


0096891P. 


PR 


17- 


-AUG- 


1998; 


98US- 


0096897P. 


PR 


18- 


-AUG- 


1998; 


98US- 


0096949P. 


PR 


18- 


-AUG- 


1998; 


98US- 


0096959P. 


PR 


18- 


-AUG- 


1998; 


98US- 


0097022P. 


PR 


26- 


-AUG- 


1998; 


98US- 


0097952P. 


PR 


26- 


-AUG- 


1998; 


98US- 


0097954P. 


PR 


26- 


-AUG- 


1998; 


98US- 


0097955P. 


PR 


26- 


-AUG- 


1998; 


98US- 


0097971P. 


PR 


26- 


-AUG- 


1998; 


98US- 


0097974P. 


PR 


26- 


-AUG- 


1998; 


98US- 


0098014P. 



PR 


01- 


-SEP- 


-1998; 98US- 


-0098716P. 


PR 


01- 


-SEP- 


-1998 


98US- 


-0098723P. 


PR 


02- 


-SEP- 


-1998 


98US- 


-0098803P. 


PR 


02- 


-SEP- 


-1998 


98US- 


-0098821P. 


PR 


02- 


-SEP- 


-1998 


98US- 


-0098843P. 


PR 


09- 


-SEP- 


-1998 


98US- 


-0099602P. 


PR 


10- 


-SEP- 


-1998 


98US- 


-0099741P. 


PR 


10- 


-SEP- 


-1998 


; 98US- 


-0099754P. 


PR 


10- 


-SEP- 


-1998 


98US- 


-0099763P. 


PR 


10- 


-SEP- 


-1998 


98US- 


-0099812P. 


PR 


15- 


-SEP- 


-1998 


; 98US- 


-0100388P. 


PR 


16- 


-SEP- 


-1998 


98US- 


-0100662P. 


PR 


16- 


-SEP- 


-1998 


98US- 


-0100664P. 


PR 


16- 


-SEP- 


-1998 


98US- 


-0101751P. 


PR 


16- 


-SEP- 


-1998, 


; 98WO- 


-US019330. 


PR 


17- 


-SEP- 


■1998, 


; 98US- 


■0100683P. 


PR 


17- 


-SEP- 


-1998, 


; 98US- 


-0100684P. 


PR 


17- 


-SEP- 


•1998, 


r 98US- 


0100919P. 


PR 


17- 


-SEP- 


■1998, 


? 98US- 


•0100930P. 


PR 


18- 


■SEP- 


■1998, 


r 98US- 


0100849P. 


PR 


18- 


-SEP- 


1998, 


' 98US- 


0101014P. 


PR 


18- 


-SEP- 


1998, 


98US- 


0101068P. 


PR 


23- 


■SEP- 


1998; 


98US- 


0101471P. 


PR 


23- 


SEP- 


1998; 


98US- 


0101472P. 


PR 


23- 


•SEP- 


1998; 


98US- 


0101475P. 


PR 


23- 


■SEP- 


1998; 


98US- 


0101477P. 


PR 


24- 


SEP- 


1998; 


98US- 


0101738P. 


PR 


24- 


SEP- 


1998; 


98US- 


0101739P. 


PR 


24- 


SEP- 


1998; 


98US- 


0101743P. 


PR 


24- 


SEP- 


1998; 


98US- 


0101922P. 


PR 


25- 


SEP- 


1998; 


98US- 


0101786P. 


PR 


29- 


SEP- 


1998; 


98US- 


0102207P. 


PR 


29- 


SEP- 


1998; 


98US- 


0102240P. 


PR 


29- 


SEP- 


1998; 


98US- 


0102330P. 


PR 


29- 


SEP- 


1998; 


98US- 


0102331P. 


PR 


30- 


SEP- 


1998; 


98US- 


0102487P. 


PR 


30- 


SEP- 


1998; 


98US- 


0102570P. 


PR 


30- 


SEP- 


1998; 


98US- 


0102571P. 


PR 


01- 


OCT- 


1998; 


98US- 


0102684P. 


PR 


01- 


OCT- 


1998; 


98US- 


0102687P. 



Query Match 98.9%; Score 1784.4; DB 7; Length 1879; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1785; Conservative 0; Mismatches 1; Inciels 0; Gaps 0; 

Qy 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

M I I I I I I I I i I I I I I I I I I M M I I I I I I I I I M I I I I i I i i I I I I I I I I I I I t I I I I I 
Db 94 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 153 

Qy 61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

I M I t I I I I I i M I I I I I I I I I I t I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 154 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 213 

Qy 121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 180 

I I I I I I 1 I I I I I I I I M I I I I II I I I I I I i I I I I I I i I M I I I I I I I I I I I I M I I I I I I 
Db 214 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 273 



Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 
1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 II 

1 1 1 1 I I I 1 1 1 1 1 1 II 1 I 1 1 II 1 1 1 1 | | 1 1 | j | | M | | | | | | | | | | | | | | 1 | | | | | | | | | | 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


274 


333 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 

1 1 1 1 I 1 1 1 M 1 I 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 i 1 1 1 1 I I 1 I I i I I I I I i i i i 
1 1 1 1 > 1 1 i i i i i i i i i i i i i i ii i i i i i i i i i i i m i n i i i i i i i i i i i i m i i i i i i i 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


334 


393 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 
1 I 1 1 I 1 1 M M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 II 1 1 1 I I II l l l l l I l i it I 

1 11 1 1 1 1 1 | I ii i I I I I I I i I I I II 1 II 1 I I 1 M I M | M 1 | | | | | | 1 | | | | | | | | | | | | 

C C C C C GCAGAAGC TAC AGAT T CT CGT T GAC ACT G GAAGCAGT AACT TT GC C GT G G C AGGA 


360 


Db 


394 


453 


Qy 


361 


AC C C C GC AC T C CT AC AT AGAC AC GT ACT T T GAC ACAGAGAGGT CT AGC ACAT AC C G C T C C 
1 1 i 1 1 1 I 1 1 1 I 1 I 1 1 1 1 1 M 1 1 1 1 M M 1 1 1 II 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i i i 

1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 | | | M | 1 II II f 1 1 1 1 1 1 1 1 1 1 II | | M 

AC C C C GCAC T C CTAC AT AGAC AC GT ACT T T GAC ACAGAGAGGT CT AGC ACAT AC C GCT C C 


420 


Db 


454 


513 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 
1 1 I 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M i i i i 

1 1 1 1 1 1 1 1 l I I I I I I I I I I I I I I I I 1 1 1 1 1 1 1 1 1 I M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


514 


573 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 
1 1 M 1 1 M 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 I I I M I I I I I i i i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 i < i i i i i i i I I I I M I l l l l l l l l | | | M 1 1 1 1 II 1 1 1 M 1 II II II 1 II 1 1 

GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


574 


633 


Qy 


541 


TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 If 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 1 1 1 1 1 1 i 1 i i i i 
' 1 > 1 1 1 > 1 ' 1 > 1 1 ' i i i i i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 

T T T GAAT CAGAGAAT TT CT T T T T GC C T G GGAT T AAAT GGAAT GGAAT AC TT G G C CT AGCT 


600 


Db 


634 


693 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 

I 1 I 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 1 1 1 1 1 1 I i I i i i i i i i i i i i 
1 1 1 1 1 1 ' 1 >■ ' ■< i i i I I I I I I I I I I I I I I I 1 1 1 1 1 I 1 1 1 1 1 1 I 1 1 1 1 1 ( 1 1 1 1 1 1 1 1 1 1 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


694 


753 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 

I 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l I I I I I I I i i i i i i i i 
1 1 ' 1 1 1 1 1 1 i i i i i i ' i i t i i i i i i i i i i i i i i i f i I 1 1 1 1 1 1 1 1 | || 1 1 1 1 1 1 1 1 1 1 1 1 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


754 


813 


Qy 


721 


GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l I I I I I I i i 
1 1 1 1 1 ' 1 1 1 1 1 1 i i i i i i i i i i i i i i i ii i i i u i 1 1 II 1 1 1 1 1 1 1 1 1 | j | | | | | | | | | | 

GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 


780 


Db 


814 


873 


Qy 


781 


GGAGAC AT C T G GT AT AC C C CT AT T AAGGAAGAGT GGTACT AC CAGAT AGAAAT T CT GAAA 

IE 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I I I I I I I I 
1 1 1 1 1 1 1 1 1 1 1 i i i i i i i i i i i ii i i i i i i i i ii i i i i i i i i i i i i | | 1 1 1 1 1 1 1 1 II 1 1 

GGAGAC AT C T GGT AT AC C C CT AT TAAGGAAGAGT GGTAC TAC CAGAT AGAAAT T CT GAAA 


840 


Db 


874 


933 


Qy 


841 


T T GGAAAT T GGAGG C CAAAG C CT TAAT C T G GACT GCAGAGAGTAT AAC G C AGACAAGGC C 
1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 11 1 1 1 1 1 1 1 M 1 M 1 1 1 1 | | 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 | | | 1 1 1 1 1 1 | | M 1 1 1 1 1 1 

T T GGAAATT GGAGGC CAAAG C CT T AAT C T G GACT GC AG AG AG TAT AAC G C AGACAAGGC C 


900 


Db 


934 


993 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 

1 1 1 1 1 I 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 l l l I I I I I I I I I I I M 
1 1 1 1 1 1 1 M ' i i i I i i i i i i i i i i i i i i i i i i i i i i l l l 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 


Db 


994 


1053 


Qy 


961 


GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 
1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I M | | | | | | | | | | | | | 
GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 


1020 


Db 


1054 


1113 


Qy 


1021 


CAGCT GGC GT GCT GGACGAATTCGGAAACACCTT GGT CT TACT TCCCTAAAATCTC CATC 


1080 



Db 


1114 


1 1 1 1 1 M II M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 I I 1 I I t M i I i I I I l I t i i i i i i 

1 1 1 1 1 1 1 ! 1 | 1 1 1 1 1 II M | | | | | | | | | M | 1 1 1 1 II II 1 1 1 1 1 1 I | M 1 II 1 II 1 | 1 1 1 

CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 


1173 


Qy 


1081 


TAC C TGAGAGAT GAGAACT C CAGCAGGT CAT T C C GT AT C ACAAT C CT GC C T CAG C TT T AC 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 I II 1 If 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 I 1 l I 1 I l l l 1 I l l i l f I 
i l l t l l 1 1 1 1 1 1 1 i 1 I > 1 ! 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

TAC C T GAGAGAC GAGAACT C CAGCAG GT CAT T C C GT AT C AC AAT C CT GC C T CAG C T T TAC 


1140 


Db 


1174 


1233 


Qy 


1141 


AT T C AGC C CAT GAT GGGGGCCGGCCT GAAT TAT GAAT GT TAC C GAT T C GGC AT T T C C C CA 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 i 1 1 1 
1 1 1 ' 1 1 1 i i i i ' i i i i i i i i i I I I I I I I I I I I I I I I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 

AT T C AGC C CAT GAT GGGGGCCGGCCT GAAT TAT GAAT GT TAC C GAT T C GGCAT T T C C C C A 


1200 


Db 


1234 


1293 


Qy 


1201 


TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 II 1 1 1 1 1 1 1 i i i i i i i i i 

1 1 ' M 1 > M 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I | 1 | | 1 1 1 1 1 1 1 1 1 1 

TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 


1260 


Db 


1294 


1353 


Qy 


1261 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 

1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i i i i i i 
i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i 1 1 1 1 1 | 1 | | 1 II 1 1 f 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


1320 


Db 


1354 


1413 


Qy 


1321 


GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 I I I I I I I I I I i i i i i i i i i i i t i r i 
1 1 ' 1 1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 I I I I I I I I I I | { | 1 1 1 1 I 1 1 1 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


1380 


Db 


1414 


1473 


Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 

1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 ' 1 1 1 > l I I I I I I I I l I I I I I I I | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 

CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1440 


Db 


1474 


1533 


Qy 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i i i i i i i i i i i i i 
1 1 >> i > < I I I I I I I I I f I I I I I I I I I I I I I I 1 f If 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 I I I 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 


1534 


1593 


Qy 


1501 


C GT GAC C CT GAGGT CGT CAAT GAT GAGT C C T C T CT GGT CAGACAT C GC T GGAAAT GAAT A 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 i [ i i i i i f i i i 1 1 1 1 1 1 | ( f | 1 1 1 1 1 1 1 1 1 f II 1 1 1 1 1 1 1 1 If 1 1 1 1 If 1 1 1 1 1 1 1 

C GT GACC CT GAGGT CGT CAAT GAT GAGT C C T C T CT GGT CAGACAT C GCT GGAAAT GAAT A 


1560 


Db 


1594 


1653 


Qy 


1561 


GC CAGGCCT GAC CTCAAGCAACCAT GAACT CAGCTATTAAGAAAAT CACATTTC CAGGGC 
1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i i i i t i i i i 

1 1 1 > 1 1 M M 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | 1 M 1 1 1 1 1 1 | | | | | | | | | | | M | 

G C CAG GC C T GAC CT CAAGCAAC CAT GAACT C AGC TAT TAAGAAAAT C ACATT T C C AG GG C 


1620 


Db 


1654 


1713 


Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 

1 1 II 1 1 1 1 1 1 1 1 1 1 I 1 1 If 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 I I I I i i I I I i 
i i i i i i i i i t i I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l 1 1 | | | | | M I 1 1 1 1 1 1 1 I I I I I I I I I I 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


1714 


1773 


Qy 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 

M 1 1 M M 1 1 1 1 1 1 1 I 1 1 1 I 1 1 I I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1774 


1833 


Qy 


1741 


CT C C C TAC T T C C AAGAAAAAT AAT TAAAAAAAAAACT T CAT T CTAA 17 8 6 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 I I I | i | | | | 

CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAA 187 9 




Db 


1834 





RESULT 15 
ACC86929 

ID ACC86929 standard; cDNA; 1879 BP. 
XX 

AC ACC86929; 



XX 

DT 

XX 

DE 

XX 

KW 

KW 

KW 

KW 

KW 

KW 

KW 

XX 

OS 

XX 

PN 

XX 



05-AUG-2003 (first entry) 

Human secreted polypeptide PR08 52-encoding cDNA, SEQ ID NO: 71. 

Human; PRO; secreted protein; transmembrane protein; 
extracellular domain; tumour necrosis factor-alpha; TNF-alpha; 
chondrocyte; proliferation; differentiation; cartilage disorder; 
bone disorder; arthritis; sports injury; cancer; tumour; diagnosis; 
adrenal tumour; lung; colon; breast; prostate; kidney; rectum; cervix; 
liver; drug screening; transgenic animal; genetic analysis; 
antiarthritic; vulnerary; gene therapy; gene; ss . 

Homo sapiens. 
US2003036159-A1. 



PD 


20 


-FEB- 


-2003. 






XX 












PF 


02 


-JUL- 


-2002; 


2002US- 


-00188773. 


XX 












PR 


18 


-SEP- 


-1997; 


97US- 


-0059263P. 


PR 


18 


-SEP- 


-1997; 


97US- 


-0059266P. 


PR 


17 


-OCT- 


-1997; 


97US- 


-0062250P. 


PR 


21 


-OCT- 


-1997; 


97US- 


-0063486P. 


PR 


24 


-OCT- 


-1997; 


97US- 


-0063120P. 


PR 


24 


-OCT- 


-1997; 


97US- 


-0063121P. 


PR 


28 


-OCT- 


-1997; 


97US- 


-0063540P. 


PR 


28 


-OCT- 


-1997; 


97US- 


-0063541P. 


PR 


28- 


-OCT- 


-1997; 


97US- 


-0063544P. 


PR 


28- 


-OCT- 


-1997; 


97US- 


-0063564P. 


PR 


29- 


-OCT- 


-1997; 


97US- 


-0063734P. 


PR 


31- 


-OCT- 


-1997; 


97US- 


-0063870P. 


PR 


31- 


-OCT- 


-1997; 


97US- 


-0064103P. 


PR 


13- 


-NOV- 


-1997; 


97US- 


-0065311P. 


PR 


21- 


-NOV- 


-1997; 


97US- 


-0066120P. 


PR 


24- 


-NOV- 


-1997; 


97US- 


■0066466P. 


PR 


24- 


-NOV- 


■1997; 


97US- 


0066772P. 


PR 


11- 


-DEC- 


■1997; 


97US- 


0069335P. 


PR 


12- 


-DEC- 


•1997; 


97US- 


0069425P. 


PR 


17- 


-DEC- 


1997; 


97US- 


0069870P. 


PR 


18- 


-DEC- 


1997; 


97US- 


0068017P. 


PR 


10- 


-MAR- 


1998; 


98US- 


0077450P. 


PR 


11- 


-MAR- 


1998; 


98US- 


0077632P. 


PR 


11- 


-MAR- 


1998; 


98US- 


0077649P. 


PR 


20- 


-MAR- 


1998; 


98US- 


0078886P. 


PR 


20- 


-MAR- 


1998; 


98US- 


0078939P. 


PR 


27- 


-MAR- 


1998; 


98US- 


0079664P. 


PR 


27- 


-MAR- 


1998; 


98US- 


0079786P. 


PR 


31- 


-MAR- 


1998; 


98US- 


0080107P. 


PR 


31- 


-MAR- 


1998; 


98US- 


0080194P. 


PR 


01- 


-APR- 


1998; 


98US- 


0080327P. 


PR 


01- 


-APR- 


1998; 


98US- 


0080333P. 


PR 


08- 


-APR- 


1998; 


98US- 


0081049P. 


PR 


08- 


-APR- 


1998; 


98US- 


0081070P. 


PR 


09- 


-APR- 


1998; 


98US- 


0081195P. 


PR 


15- 


-APR- 


1998; 


98US- 


0081838P. 



PR 


21 


-APR 


-1998; 


98US 


-0082568P. 


PR 


21 


-APR 


-1998; 


98US 


-0082569P. 


PR 


22 


-APR 


-1998; 


98US 


-0082704P. 


PR 


22 


-APR 


-1998; 


98US 


-0082797P. 


PR 


28 


-APR- 


-1998; 


98US 


-0083322P. 


PR 


29 


-APR- 


-1998; 


98US 


-0083495P. 


PR 


29 


-APR- 


-1998; 


98US 


-0083496P. 


PR 


29 


-APR- 


-1998; 


98US- 


-0083499P. 


PR 


29 


-APR- 


-1998; 


98US- 


-0083559P. 


PR 


05- 


-MAY- 


-1998; 


98US- 


-0084366P. 


PR 


06- 


-MAY- 


-1998; 


98US- 


-0084414P. 


PR 


07- 


-MAY- 


-1998; 


98US- 


-0084639P. 


PR 


07- 


-MAY- 


-1998; 


98US- 


-0084640P. 


PR 


07- 


-MAY- 


-1998; 


98US- 


-0084643P. 


PR 


15- 


-MAY- 


-1998; 


98US- 


-0085579P. 


PR 


15- 


-MAY- 


-1998; 


98US- 


-0085580P. 


PR 


15- 


-MAY- 


-1998; 


98US- 


-0085582P. 


PR 


15- 


-MAY- 


-1998; 


98US- 


-0085700P. 


PR 


18- 


-MAY- 


-1998; 


98US- 


-0086023P. 


PR 


22- 


-MAY- 


-1998; 


98US- 


-0086392P. 


PR 


22- 


-MAY- 


-1998; 


98US- 


-0086486P. 


PR 


28- 


-MAY- 


-1998; 


98US- 


-0087098P. 


PR 


28- 


-MAY- 


-1998; 


98US- 


-0087208P. 


PR 


02- 


-JUN- 


-1998; 


98US- 


-0087609P. 


PR 


02- 


-JUN- 


-1998; 


98US- 


-0087759P. 


PR 


03- 


- JUN- 


-1998; 


98US- 


-0087827P. 


PR 


04- 


-JUN- 


-1998; 


98US- 


-0088025P. 


PR 


04- 


-JUN- 


-1998; 


98US- 


-0088028P. 


PR 


04- 


-JUN- 


■1998; 


98US- 


-0088029P. 


PR 


04- 


-JUN- 


1998; 


98US- 


-0088033P. 


PR 


04- 


-JUN- 


-1998; 


98US- 


-0088326P. 


PR 


05- 


- JUN- 


•1998; 


98US- 


-0088167P. 


PR 


05- 


-JUN- 


-1998; 


98US- 


-0088202P. 


PR 


05- 


-JUN- 


1998; 


98US- 


0088212P. 


PR 


05- 


-JUN- 


•1998; 


98US- 


-0088217P. 


PR 


09- 


■JUN- 


1998; 


98US- 


-0088655P. 


PR 


10- 


• JUN- 


1998; 


98US- 


0088722P. 


PR 


10- 


JUN- 


1998; 


98US- 


0088738P. 


PR 


10- 


-JUN- 


1998; 


98US- 


■0088740P. 


PR 


10- 


JUN- 


1998; 


98US- 


0088811P. 


PR 


10- 


JUN- 


1998; 


98US- 


0088824P. 


PR 


10- 


JUN- 


1998; 


98US- 


0088825P. 


PR 


10- 


JUN- 


1998; 


98US- 


0088826P. 


PR 


11- 


JUN- 


1998; 


98US- 


0088861P. 


PR 


11- 


JUN- 


1998; 


98US- 


0088863P. 


PR 


11- 


JUN- 


1998; 


98US- 


0088876P. 


PR 


12- 


JUN- 


1998; 


98US- 


0089090P. 


PR 


12- 


JUN- 


1998; 


98US- 


0089105P. 


PR 


16- 


JUN- 


1998; 


98US- 


0089512P. 


PR 


16- 


JUN- 


1998; 


98US- 


0089514P. 


PR 


17- 


JUN- 


1998; 


98US- 


0089538P. 


PR 


17- 


JUN- 


1998; 


98US- 


0089598P. 


PR 


17- 


JUN- 


1998; 


98US- 


0089653P. 


PR 


18- 


JUN- 


1998; 


98US- 


0089908P. 


PR 


19- 


JUN- 


1998; 


98US- 


0089952P. 


PR 


22- 


JUN- 


1998; 


98US- 


0090246P. 


PR 


22- 


JUN- 


1998; 


98US- 


0090252P. 



PR 


22 


-JUN- 


-1998; 


98US 


-0090254P. 


PR 


24 


-JUN- 


-1998; 


98US 


-0090429P. 


PR 


24 


-JUN- 


-1998; 


98US- 


-0090435P. 


PR 


24 


-JUN- 


-1998; 


98US- 


-O090444P. 


PR 


24 


-JUN- 


-1998; 


98US- 


-0090461P. 


PR 


24 


-JUN- 


-1998; 


98US- 


-0090535P. 


PR 


24 


-JUN- 


-1998; 


98US- 


-0090540P. 


PR 


25 


-JUN- 


-1998; 


98US- 


-0090676P. 


PR 


25- 


-JUN- 


-1998; 


98US- 


-0090678P. 


PR 


25 


-JUN- 


-1998; 


98US- 


-0090688P. 


PR 


25- 


-JUN- 


-1998; 


98US- 


-0090690P. 


PR 


25- 


-JUN- 


-1998; 


98US- 


-0090694P. 


PR 


25- 


- JUN- 


-1998; 


98US- 


-0090695P. 


PR 


25- 


-JUN- 


-1998; 


98US- 


-0090696P. 


PR 


26- 


-JUN- 


-1998; 


98US- 


-00105413. 


PR 


26- 


-JUN- 


-1998; 


98US- 


-0090862P. 


PR 


26- 


-JUN- 


-1998; 


98US- 


-0090863P. 


PR 


26- 


-JUN- 


-1998; 


98US- 


-0091010P. 


PR 


01- 


-JUL- 


-1998; 


98US- 


-0091359P. 


PR 


01- 


-JUL- 


-1998; 


98US- 


-0091544P. 


PR 


02- 


-JUL- 


-1998; 


98US- 


-0091478P. 


PR 


02- 


-JUL- 


-1998; 


98US- 


-0091486P. 


PR 


02- 


-JUL- 


-1998; 


98US- 


-0091626P. 


PR 


02- 


-JUL- 


-1998; 


98US- 


-0091628P. 


PR 


02- 


-JUL- 


-1998; 


98US- 


-0091632P. 


PR 


24- 


-JUL- 


-1998; 


98US- 


-0094006P. 


PR 


04- 


-AUG- 


-1998; 


98US- 


-0095282P. 


PR 


10- 


-AUG- 


-1998; 


98US- 


-0095998P. 


PR 


10- 


-AUG- 


-1998; 


98US- 


-0096012P. 


PR 


17- 


-AUG- 


-1998; 


98US- 


-0096757P. 


PR 


17- 


-AUG- 


-1998; 


98US- 


-0096766P. 


PR 


17- 


-AUG- 


■1998; 


98US- 


-0096867P. 


PR 


17- 


-AUG- 


1998; 


98US- 


-0096891P. 


PR 


17- 


-AUG- 


■1998; 


98US- 


-0096897P. 


PR 


18- 


-AUG- 


1998; 


98US- 


-0096949P. 


PR 


18- 


-AUG- 


1998; 


98US- 


•0096959P. 


PR 


18- 


AUG- 


1998; 


98US- 


0097022P. 


PR 


26- 


-AUG- 


1998; 


98US- 


■0097952P. 


PR 


26- 


-AUG- 


1998; 


98US- 


0097954P. 


PR 


26- 


-AUG- 


1998; 


98US- 


0097955P. 


PR 


26- 


-AUG- 


1998; 


98US- 


0097971P. 


PR 


26- 


-AUG- 


1998; 


98US- 


0097974P. 


PR 


26- 


-AUG- 


1998; 


98US- 


0098014P. 


PR 


01- 


-SEP- 


1998; 


98US- 


0098716P. 


PR 


01- 


-SEP- 


1998; 


98US- 


0098723P. 


PR 


02- 


SEP- 


1998; 


98US- 


0098803P. 


PR 


02- 


SEP- 


1998; 


98US- 


0098821P. 


PR 


02- 


SEP- 


1998; 


98US- 


0098843P. 


PR 


09- 


SEP- 


1998; 


98US- 


0099602P. 


PR 


10- 


SEP- 


1998; 


98US- 


0099741P. 


PR 


10- 


■SEP- 


1998; 


98US- 


0099754P. 


PR 


10- 


SEP- 


1998; 


98US- 


0099763P. 


PR 


10- 


SEP- 


1998; 


98US- 


0099812P. 


PR 


15- 


SEP- 


1998; 


98US- 


0100388P. 


PR 


16- 


SEP- 


1998; 


98US- 


0100662P. 


PR 


16- 


SEP- 


1998; 


98US- 


0100664P. 


PR 


16- 


SEP- 


1998; 


98US- 


0101751P. 



PR 


16 


-SEP- 


-1998 


9 8WO- 


-US019330 . 


PR 


17 


-SEP- 


■1998 


98US- 


-0100683P. 


PR 


17 


-SEP- 


•1998 


98US- 


-0100684P. 


PR 


17 


-SEP- 


■1998 


; 98US- 


-0100919P. 


PR 


17 


-SEP- 


1998 


98US- 


-0100930P. 


PR 


18 


-SEP- 


-1998 


; 98US- 


■0100849P. 


PR 


18 


-SEP- 


1998, 


; 98US- 


0101014P. 


PR 


18 


-SEP- 


1998, 


; 98US- 


0101068P. 


PR 


23 


-SEP- 


1998, 


? 98US- 


0101471P. 


PR 


23 


-SEP- 


1998, 


; 98US- 


0101472P. 


PR 


23 


-SEP- 


1998, 


; 98US- 


0101475P. 


PR 


23 


-SEP- 


1998, 


? 98US- 


0101477P. 


PR 


24 


-SEP- 


1998, 


r 98US- 


0101738P. 


PR 


24 


-SEP- 


1998, 


r 98US- 


0101739P. 


PR 


24- 


-SEP- 


1998, 


r 98US- 


0101743P. 


PR 


24- 


-SEP- 


1998; 


: 98US- 


0101922P. 


PR 


25- 


-SEP- 


1998; 


98US- 


0101786P. 


PR 


29- 


-SEP- 


1998; 


98US- 


0102207P. 


PR 


29- 


-SEP- 


1998; 


98US- 


0102240P. 


PR 


29- 


-SEP- 


1998; 


98US- 


0102330P. 


PR 


29- 


-SEP- 


1998; 


98US- 


0102331P. 


PR 


30- 


-SEP- 


1998; 


98US- 


0102487P. 


PR 


30- 


-SEP- 


1998; 


98US- 


0102570P. 


PR 


30- 


-SEP- 


1998; 


98US- 


0102571P. 


PR 


01- 


-OCT- 


1998; 


98US- 


0102684P. 


PR 


01- 


-OCT- 


1998; 


98US- 


0102687P. 



Query Match 98.9%; 
Best Local Similarity 99.9%; 
Matches 1785; Conservative 



Qy 

Db 



Score 1784.4; DB 7; Length 1879; 
Pred. No. 0; 
0; Mismatches 1; Indels 0; Gaps 



0; 



1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 
I I I I I I I I II I I I I I I I I I I I I I I I I I I I I i I II I I f I I I I t I I I I I I I I I I I I I II I I I 
94 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 153 



Qy 

Db 



61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II I II II I I I I I I I I || M | M II 
154 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 213 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

I I I I M I I I I I I I I I I M I M I I I I I I I I I I I I I II | | | | | | | II | | M I I I I I I I I I I I 
214 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 273 



181 



241 



301 



240 



GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
274 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 333 



300 



GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 

I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | M I I I I I I 
334 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 393 



360 



CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 
I I I I I I I I I I I I I I I I M I i I II I I I I I I II I I I | | | | M I I I I I I I I I I I I I I I I I I I I 
394 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 4 53 



Qy 



361 AC C C CG C ACT C C T AC AT AGAC AC GT AC T TT GAC AC AGAGAG GT CT AG C AC AT AC C GCT C C 42 0 
I I I I I I I I I I I I I I M I I I I I ! I I I II I I I I I I I | | | M I I I I I I I I I I I I I I I I I I I I I 



Db 454 AC C C C G CAC T C C T AC AT AGAC AC GTACT T T GAC ACAGAGAGGT CT AGCACAT AC C GC T C C 513 

Qy 421 AAG G G C T T T GAC GT C ACAGT GAAGTAC ACACAAG GAAG CT GGAC G G GCT T C GT T GGG GAA 480 

I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I | | | | | 
Db 514 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 573 

Qy 481 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

I I I I I I I I II I I I I I I I I t I I t I I I I I I I I I I I I I I I I M I I I I I I I I | I ( I I I I j | | | | 
Db 574 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 63 3 

Qy 541 T T T GAAT CAGAGAAT TTCTTTTTGCCT GGGAT TAAAT G GAAT GGAAT AC T T GGC CT AGCT 600 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I It I I 
Db 63 4 TT T GAAT CAGAGAAT TTCTTTTTGCCTGG GAT TAAAT GGAAT G GAAT ACT T G GC CT AGC T 693 

Qy 601 TAT G C C ACAC T T GC CAAGC CAT CAAGT T CT C T GGAGACCT T C T T C GAC T C C CT G GT GAC A 660 

I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I II I I I I I | | | | 
Db 694 TAT GC C ACAC T T GC CAAG C CAT CAAGT T CT CT G GAGAC C TT C T T C GACT C C CT G GT GAC A 753 

Qy 661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 754 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 813 

Qy 721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 7 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | I I I I I I II I I I I I I I I I 
Db 814 G GAT CT GGGAC CAAC G GAG GTAGT CT TGTCTTGGGT GGAAT TGAAC CAAGT TTGT AT AAA 873 

Qy 781 GGAGAC AT CT GGT AT AC C C CT AT T AAGGAAGAGT GGTACT AC CAGAT AGAAATT C T GAAA 84 0 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I M I I I I I I I I I I I I I I I I I I 
Db 87 4 G GAGAC AT C T GGT AT AC C C CT AT TAAG GAAGAGT GGT ACTAC CAGAT AGAAAT T CT GAAA 933 

Qy 841 T T G GAAAT T G GAGGC C AAAGC CT T AAT C T GGACT G CAG AGAGT ATAAC GC AGACAAG G C C 90 0 

I M I I I I I I I I I I I I I I I I E I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 934 T T G GAAAT T GGAGG C CAAAGC C T TAAT C T GGAC T GCAGAGAGT AT AAC GC AGACAAG GC C 993 

Qy 901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M 
Db 994 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1053 

Qy 961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I E I I I I I I I I I I I I I I I I E I I I I I I I I 
Db 1054 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1113 

Qy 1021 CAGCTGGCGT GCT GGAC GAAT TCGGAAACACCTT GGT CTTACTTCCCTAAAATCTCC AT C 10 8 0 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II IE I f I I I I I I I I I I I I I II M I I I I I I 
Db 1114 CAGCTGGCGT GCT GGACGAATTCGGAAACACCTT GGT CTTACTTCCCTAAAATCTCCATC 1173 

Qy 1081 T AC CT GAGAGAT GAGAACT C CAG CAG GT CAT T C C GT AT C ACAAT C CT G C C T C AG CT T T AC 114 0 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I IE I I I I II I I I I I I I I II I I I I I I I I I I I 

Db 117 4 T AC C T GAGAGAC GAGAACT C C AGC AGGT CAT T C C GT AT C ACAAT C CT GC CT CAGCT T T AC 12 33 

Qy 1141 ATT CAG C C CAT GAT GGGGGCCGGCCT GAAT TAT GAAT GT T AC C GAT T C GG CAT TT C C C CA 1200 

II I I I I I I I I I I I I I I I I I I I I I I I I II I E I I I I I I I I I I I II I I I I | | | | | | | | | I I I I 

Db 1234 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1293 

Qy 12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGT CATCTTCGAC 12 60 

I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I t E I I I I I I I I I I I I I I I M I 
Db 12 94 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1353 



Qy 12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

I I I I I I M I I I I I I I I f I I I I I I I I If I I I I I t I I I I i I I I I I I I I | | | | I | | t I I I I I I 
Db 1354 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1413 

Qy 1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 13 8 0 

I I I I M I I I I I II I I I I I I I I I t I I I I I I I I I I I I I I I I I | | M | | | | | M I M I i I I I I 
Db 1414 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 14 7 3 

Qy 1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1440 

I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I II I I I I | | | | | | | | | | | | | | | M | | | | 
Db 1474 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1533 

Qy 1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

I I I I I I I I M I I I I I I I I I I I I M I I I I I II I I M I I I I I | | | | | | | | | | | | | | | | | | | | 
Db 1534 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1593 

Qy 1501 C GT GAC C C T GAG GT C GT CAAT GAT GAGT C C T CT CT GGT CAGACAT C GC T G GAAAT GAAT A 1560 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | M I 
Db 1594 C GT GAC C CT GAGGT C GT CAAT GAT GAGT CCT CT CT G GT CAGACAT C G CT GGAAAT GAAT A 1653 

Qy 1561 G C CAGGC CT GAC CT CAAGCAAC CAT GAACT C AGCT AT T AAGAAAAT C AC AT T T CCAG G G C 162 0 

M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I | I | | | | | | | || M | | || | | | 
Db 1654 G C C AGG C C T GAC C T CAAG CAAC CAT GAACT C AGCT AT TAAGAAAAT C AC AT T T CCAG GG C 1713 

Qy 1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

I I I I I I I I M I I I II I I I I I I I I I I I I I I | | | | | | | | | | | | | | I | | | | | | | | | | | | | | | | 
Db 1714 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 17 73 

Qy 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I II I I I I I I f I I I I I I I I I I 
Db 1774 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1833 

Qy 1741 C T C C C T AC T T C C AAG AAAAAT AAT T AAAAAAAAAAC T T CAT T C T AA 1786 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 18 34 CT C CCTACTT CCAAGAAAAAT AATT AAAAAAAAAACTT CATT CTAA 187 9 

Search completed: March 5, 2004, 21:23:48 
Job time : 710 sees 
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Run on : 



March 



; Search time 154 Seconds 
(without alignments) 

6500.853 Million cell updates/sec 



Title: 

Perfect score 
Sequence : 



US-09-668-314C-1 
1804 

1 atgggcgcactggcccgggc 



aaaccaaaaaaaaaaaaaaa 1804 



Scoring table: IDENTITY__NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 682709 seqs, 277475446 residues 

Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



1365418 



Database : 



1 
2 
3 
4 
5 
6 



Issued_Patents_NA: * 

/cgn2_6/ptodata/2/ina/5A_COMB. seq: * 
/cgn2_6/ptodata/2/ina/5B_COMB . seq: * 
/cgn2_6/ptodata/2/ina/6A_COMB. seq: * 
/cgn2 6/ptodata/2/ ina/6B COMB. seq:* 
/cgn2_6/ptodata/2/ina/PCTUS_COMB. seq: * 
/ cgn2_6/ptodata/2/ ina/backf ilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 

US-09-548-372D-1 

; Sequence 1, Application US/09548372D 
; Patent No. 6420534 
; GENERAL INFORMATION: 

APPLICANT: GURNEY ET AL . 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 
FILE REFERENCE: 29915/62801 

CURRENT APPLICATION NUMBER: US/09/54 8 , 372D 

CURRENT FILING DATE: 2000-04-12 
; PRIOR APPLICATION NUMBER: US 60/155,493 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: PCT/US99/20881 

PRIOR FILING DATE: 1999-09-23 



PRIOR APPLICATION NUMBER: US 60/101,594 

PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 1 
; LENGTH: 1804 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-548-372D-1 

Query Match 100.0%; Score 1804; DB 4; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

I I I i I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I i I I I | I | I I I I I | | | | | | 
Db 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

Qy 61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I II I M I I I I M I ! I i f I I I I I I I I I I I I i I I I | I I | 
Db 61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

Qy 121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

I I I I I I I I I I II I I I I I I I I I I It I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | 
Db 121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 180 

Qy 181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I i I I I I I I I I | | | | | 
Db 181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 

Qy 241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 3 00 

Qy 301 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 

M I I I I I II I I M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 301 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 

Qy 361 AC C C C GC ACT C CT ACAT AGACAC GT ACT T T GAC AC AGAGAGGT CT AGC AC ATAC C G CT C C 420 

I I I I M I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | 
Db 361 AC C C C G CACT C C T ACAT AGACAC GT ACT T TGAC AC AGAGAG GT C TAG C ACAT AC C GCT C C 42 0 

Qy 421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 480 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I | I I I I I I I 
Db 421 AAG G G C T T T GAC GT C AC AGT GAAGT AC ACACAAG GAAGC T G GAC GGG C T T CGT T G G GGAA 4 80 

Qy 481 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I i | | | I I | | | | | M | | | | | | | | | | | 
Db 481 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

Qy 541 T T T GAAT CAGAGAAT TTCTTTTTGCCT GGGAT T AAAT GGAAT GGAAT ACT T GG C CTAGCT 600 

I I I I I I I I I I I I I I I I I I i II I M I M M I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 TTT GAAT CAGAGAATTTCTTTTTGCCTGGGATTAAATGGAAT GGAAT ACTTGGCCTAGCT 600 

Qy 601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 

I I 1 I I I M I M I I I I I I I I I I I | | | || I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
Db 601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 



Qv 


661 


C AAG C AAACAT C C C CAAC GT T T T C T C C AT G C AG AT GTRT GG A GP C GG C T T CZC r r c t t» r* r* t» 

1 1 1 1 1 1 I M 1 1 1 1 1 II 1 | | 1 | | | | | | | | | | | | M I M 1 1 t 1 1 1 1 1 | | | | | I | | || | | | | I 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


Ton 
/ z u 


Db 


661 


H 1 C\ 


Qv 


721 


GGATCTGGGACCAACGGA^GTARTrTT^TfTT^nTn^AATTflAAr'rflaflTTTrTZi'rzia z\ 

v ^•T' x x vj «wj *wj.n.\_f \j w x X X X vj x \_» x X vj \j \j x \D urVn. X X urln.^ v^.TA/A.'-J XXX vJ X J\ X J~\J-\J-\. 
1 1 1 1 1 1 1 II 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | 1 1 | 

GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 


Ton 


Db 


721 


n f? c\ 

/ OU 


Qv 


781 


GGAGACATCTGGTATArrrrTATTAAGfiAAf;AnTf;r;TArTArr , Ar:ATAraAaT i T i r r nraTi a 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | 

GGAGACAT CT GGTATACCC CTATTAAGGAAGAGT GGTACTACCAGATAGAAATT CT GAAA 


O H. U 


Db 


781 


O 4 U 


Qv 


841 


TTGGAAATTGGAGGrrAAAGrrTTAATrTGnArT^rA^Ai^ACTATAaprnararn T\r , c t r , r t 

M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I I 1 1 I I I I I I I I 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 M 1 1 I 1 1 1 

T T GGAAAT T G GAG G C CAAAG C CT T AAT C T GGACT GCAGAGAGT AT AAC GC AGACAAG G C C 


q c\ n 

yuu 


Db 


841 


C\r\r\ 

yuu 


Qv 


901 


ATCGTGGArAGTGGrArrAfGrTGrT^rf;rrTnrrrrAft;aanc r rrTT r rrflTrr , r , r' r rrr r Pr 

1 1 1 1 1 M M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 

ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


q a c\ 

y du 


Db 


901 


C\ (Z C\ 

y bu 


Ov 


961 


GAAGrTGTGGrrrGrGrATrTrT^ATTrrA^A ATTrTr , Tr:flTrr , TTTr* r Prr a r ,r rrrr r npr' 

1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 

GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 


t c\ o r\ 
lU^U 


Db 


9 61 

S \J X. 


n n o r\ 

1 OZ U 


Ov 


1021 

_l_ W t_i _l_ 


rAGrTGGPGTGr ,r rGGArRAATTr , nnAAArArrTTrr r rr r r r rarT r prrrT7i ts. TL7s. r vr* r rr*r t 7^rnr^ 

M M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M I I I I | | | | | | | | M | | | | | | | | | | | | | | | | | 

CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 


T f~\ Q Pi 

lUoU 


Db 


1021 


i n q n 
lUoU 


Ov 


1081 

JL_ W v_J _i_ 


TArrTGAGARAT^AnAArTrr'ACrAftirTr 7i^^^ t ^*^ , T'7^H 1 ^*TS.^ i 7\7\^^ , ^* ^ P^'^^•^*'T'^^• 7\ r , r , T"T"X' 7\ r> 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | 1 II 1 1 1 1 1 1 1 1 1 1 

TACCTGAGAGATGAGAACTCCAGCAGGTCATTCCGTATCACAATCCTGCCTCAGCTTTAC 


i i A r\ 
1 14 U 


Db 


1081 


T T a n 
1 X 4 U 


Ov 


1141 

_1_ _L_ -1— 


ATTrAGrrrATGATGGGGGrrnnrrTriAATTAT(^AATr'T r rarr , ra r rTrrrrn r r r r r nr , rpr , 7\ 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 


i o n n 
Xzi U U 


Db 


1141 


n o n n 


Ov 


1201 


TCCACAAATGrGrTGGTGATrGGTGrrAr^f^TnAT^a^^rTTrTnrrTrziTrTTprzir 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IE I 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 


i o ^ n 


Db 


1201 


i o s n 


Qv 


1261 


AGAGCCCAGAAGAGGGTGGGrTTrGfAGrGAGrrrrTGTnrACAAATTrzrArrTrrTrr'A 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


■ o o n 


Db 


1261 




Qv 


1321 


GTGTCTGAAATTTCCGGGCCTTTCTrAArAGAGGATGTAGrrAGrAArT^TnTrrrmrT 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


± O O U 


Db 


1321 


X o o U 


Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 

I 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 ! 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 II 1 
CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1440 


Db 


1381 


1440 


Qy 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 I I 1 | | j 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 
GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 


1441 


1500 



1501 C GT GAC C C T GAGGT C GT CAAT GAT GAGT C C T CT CT G GT CAGAC AT C GC T G GAAATGAAT A 15 60 

I M I I I I I I I I I I i I I I I I I I I I I I I | | | | | | | | | | | | | M | | | | | | | || | | | | | | | | | | 
1501 C GT GAC C CT GAGGT C GT CAAT GAT GAGT C C T CT C T GGT CAGAC AT C G CT GGAAAT GAAT A 1560 

1561 G C C AG GC CT GAC C T CAAGC AAC CAT GAACT CAGCT AT T AAGAAAAT C AC AT T T C CAG G GC 1620 

I I N I I I I II II I I I I I I I I I I | I | | | | | | | | | | | | | | || | | | || | | | | | | | | | | | | | | | 
1561 GC C AGG CC T GAC C T CAAG CAAC CAT GAACT CAG CT AT T AAGAAAAT CAC ATT T C CAG GG C 162 0 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

f f I I I f I t I I I I II II I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

I I I ' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I j I I I I I I M I f I I I I I I II I I 
1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

1741 CT CC CT ACT T C CAAGAAAAAT AATT AAAAAAAAAACT T CAT T C TAAAC CAAAAAAAAAAA 18 00 

I I N I M I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | M I I I I I I I | I I I I I I I I I I 
1741 CT C C CT ACT T C CAAGAAAAAT AAT TAAAAAAAAAACT T CAT T CTAAAC CAAAAAAAAAAA 18 00 

1801 AAAA 1804 

II I I 

1801 AAAA 1804 



RESULT 2 

US-09-548-367D-1 

; Sequence 1, Application US/09548367D 

; Patent No. 6440698 

; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL . 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 
FILE REFERENCE: 29915/6280H 

CURRENT APPLICATION NUMBER: US/0 9/54 8 , 3 67D 

CURRENT FILING DATE: 2000-04-12 

PRIOR APPLICATION NUMBER: US 60/155,493 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US 9 9/2 0 8 8 1 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 60/101,594 

PRIOR FILING DATE: 1998-09-24 

NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 1 

LENGTH: 18 04 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-548-367D-1 

Query Match 100.0%; Score 1804; DB 4; Length 1804; 

Best Local Similarity 100.0%; Preci. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



Qy 


1 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 
1 1 1 M 1 I M M 1 1 1 1 1 1 M 1 I 1 1 1 1 1 j I 1 1 1 1 1 i 1 1 M 1 M 1 M 1 1 1 1 1 1 1 1 M l i i i i i 

M II 1 1 [ M 1 I 1 1 1 1 1 1 1 1 1 | 1 | 1 1 | M ! | M II 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 j | |j [ 1 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


1 


60 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 

1 i 1 1 1 1 I 1 1 I 1 1 1 1 1 1 II 1 1 t 1 ! I 1 1 1 1 I 1 E 1 1 t 1 1 I i 1 f 1 1 I I 1 I i I I I i I I I i i i i i i 
> I i i i i i i i i i i i i i i i i i I l I l l l I l l l l l l l l l l l l 1 1 1 1 1 1 1 1 I 1 I I 1 1 1 1 1 1 1 1 1 1 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


61 


120 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 I t 1 I 1 I I I I I I i I i i i i i i i i i i t t t i i 
1 M 1 II 1 II M II I II II II M M II II II II II 1 II ! M II II ! II II 1 II I I | |[ 1 M 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


121 


180 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I f 1 1 1 i 1 1 i i i i t i i i i i i j i i i i i i i i i t 
M I I I f ll l l l l l l l If 1 1 1 1 1 1 1 II 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 | | | 1 || || [ 1 | | | II 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


181 


240 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 

1 1 1 1 1 1 1 1 I 1 1 1 1 1 I 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i i i i i i \ i i \ i i i i i i t 
1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 | 1 1 1 1 1 | M 1 II 1 II 1 1 1 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


241 


300 


Qy 


301 


C C C C C G CAGAAGC T AC AGAT T CT C GT T GAC ACT GGAAG CAGT AAC T T T GC C GT G GC AGGA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I I I I I I I I I I i i i i i i t i i i i t i i i 
1 M I I I I I I I I I I I I I I I I I I 1 f 1 1 1 1 1 1 1 1 1 1 1 I 1 | | | | | 1 | | | | | | | | | | | | | | | | 1 f 

C C C C C GCAGAAGC T ACAGAT T CT C GT T GAC ACT GGAAG CAGT AAC T T T GC C GT G GC AGGA 


360 


Db 


301 


360 


Qy 


361 


AC C C C GCACT C CT AC AT AGAC AC GT ACT T T GAC AC AGAGAGGT CTAGCACAT AC C GC T C C 

1 1 1 II i 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 I I I I I I I I I I I I I I t t i t i i i i I t i i i i i i i t i i i i i t i 
1 i 1 1 II 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I M I I | | | | If 1 1 1 M 1 1 1 

AC C C C GCACT C CT AC AT AGAC AC GT ACT TT GAC AC AGAGAG GT C TAG CAC AT AC C GCT C C 


420 


Db 


361 


420 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

1 1 1 1 1 1 1 1 M 1 ! 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 I I I I I I 1 1 1 1 1 1 1 1 1 i i i i i i i i i i i i i i i i 
■ 1 ■ 1 > 1 1 1 M 1 1 1 M 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 | | | | | | | | | | | | | 1 1 1 1 1 1 

AAGG G C T TT GAC GT C ACAGT GAAGT AC AC ACAAGGAAGC T G GAC GGGC T T C GT T G G G GAA 


480 


Db 


421 


480 


Qy 


481 


GAC CT C GT CAC CAT C C C CAAAG GCT T CAAT ACT T CTTTTCTT GT CAACAT T G C CACT AT T 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 l l l f l l l l l l I I I I I i i i i i i i i i i i r i i i i i i i i 
I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 I 1 1 I I I I I 1 I I If | 1 1 1 1 1 I 1 1 1 I II 

GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


481 


540 


Qy 


541 


TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 

1 II 1 1 1 1 1 11 1 II 1 II 1 1 1 1 t f 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 
1 'I 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I I I I | | If 1 1 1 1 1 1 1 1 1 1 1 

T T T GAAT CAGAGAAT TTCTTTTTGC CT GG GAT TAAAT GGAAT GGAAT ACT T GG C CT AGCT 


600 


Db 


541 


600 


Qy 


601 


TAT GC C AC ACT T G C CAAGC C AT CAAGT T CT C T GGAGAC CT T CT T C GACT C C C T G GT GAC A 

1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I I I I 
1 1 M II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I I I | | 1 | | | 1 | | | | 1 1 1 1 1 1 1 1 II 1 1 

TAT GCCACACTTGC CAAGC CAT CAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


601 


660 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 

1 1 1 11 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I I 
1 i i i i i ii i i i i i i i i i i i i i i i i i i i i i i i i i i | | | | | | | | | | | | | i | | j | j | | | | | | | 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


661 


720 


Qy 


721 


GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I t f I I I I I I I I I I 
> i i i i i i i i i i i i i i i i i i i i i i i i i i i I II I I I I I I I I I l I I I | | I | | | | 1 1 1 1 1 1 1 1 1 

GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 


780 


Db 


721 


780 


Qy 


781 


GGAGAC AT C T GGT AT AC C C CT AT TAAGGAAGAGT GGTAC T AC CAGAT AGAAAT T CT GAAA 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 I I I I I M i 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 I I M 1 1 1 t 1 

GGAGAC AT CT G GT AT AC C C CT AT TAAG GAAGAGT GGT ACT AC CAGAT AGAAAT T CT GAAA 


840 


Db 


781 


840 


Qy 


841 


TT G GAAAT T GGAGG C CAAAG C CT T AAT C T GGAC T G C AGAGAGTAT AAC G CAGACAAG GC C 


900 



841 T T GGAAAT T GGAGGC C AAAG C C T T AAT CT GGACT GCAGAGAGT ATAAC G CAGACAAGGC C 90 0 
9 01 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

M M I I f I I I I I I I I I I I | f I | | | | | | M | | | | | | | f | | | | | MINIM 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 
9 61 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 10? 0 

I M M I M M M M M M M M M M M M M M M M M M I M M M M M M M M I 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 
1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

I I I I I I I I I I I M I I I M M M M M M M M M M M M M M I M M M M M M M I 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 
1081 TAC C T GAGAGAT GAGAAC T CCAGCAGGT CAT T C C GT AT CACAAT C CT GC C T C AGCT T T AC 1140 

M I II I M II I I M I I II I I I II I I I I I I I M I I II II I I I II I I M M M I M I M I II 

1081 TAC C T GAGAGAT GAGAACT CC AGCAG GT CAT T C C GT AT CACAAT C C T GC CT C AGC T T TAC 114 0 
1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 120 0 

I I I I I I I I I I I I I I I I I I M I I II II I I M I I M II I I II M M M II M M M I I I I M 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 00 
12 01 T C C ACAAAT G C GCT GGT GAT C G GT G C C AC GGT GAT GGAG GGC T T CT AC GT CAT C T T C GAC 12 60 

i I M I I I I M I I M II II I I I II II M I I I I I I II M || I I M I I II I II I I I I I I | || | 

12 01 T C C ACAAAT GC GCT GGT GAT C G GT G C C AC GGT GAT GGAGG GC T T CT AC GT CAT C T T C GAC 12 60 
12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

N M I M I I If I II II II M M II I II I M II I I I I I I I II I I M I II I II I M M I I I I 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

I I I I I I I I I I I I I I I I I I If I I I II I If I I I I I | | | | | | | I I I II I I If 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 13 8 0 

13 81 C AGT CT T T GAG C GAGC C C AT TT T GT GGAT T GT GT CCT AT GC G C T CAT GAGC GT C T GT GGA 14 4 0 

I I I II I II I f I I I I I I I M II II I I I I I I If II I I I I I I I I I | | | | | || I I I I I I 

13 81 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 15 00 

I I I M I I I I I I M II I I II I I II I I I I II I II II I I I II | | M I II I I I II I I || I f| I | 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

1501 C GT GACC CT GAGGT C GT CAAT GAT GAGT C CT C T C T GGT CAGACAT C GCT G GAAAT GAAT A 1560 

' I I I I II I I II M I I I II I II I II I I M II M II I I II I I I I II I I II I II I M II II II 

1501 C GT GAC CCT GAGGT C GT CAAT GAT GAGT C CT C T CT GGT CAGACAT C GCT GGAAAT GAAT A 1560 

15 61 G C C AGGC C T GAC CT CAAGCAAC CAT GAACT C AGCT AT TAAGAAAAT CAC AT T T C C AGGG C 162 0 

I I I I I I I I I I I I M I M I M I II I I II I II I | || | | | | | | | | | f | | || || II II II I I M 
1561 G C CAG GC C T GAC CT CAAGCAAC CAT GAACT C AGCT AT TAAGAAAAT CAC AT T T C C AGGG C 162 0 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

I I I I I I I I I I I I I M I II II I I II | | | | | | | | || | || M II M I II I II I I I I M 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 
I I I I I I I I I M II I I I M I II | | | | || M II I I I I If I I I II I Ml 



Db 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

Qy 1741 CT C C C TAC T T C CAAGAAAAATAAT T AAAAAAAAAACT T CAT T CT AAAC CAAAAAAAAAAA 18 00 

I I I I I I I II I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

Db 1741 C T C C CT AC T T C CAAGAAAAATAAT TAAAAAAAAAACT T CAT T CTAAAC CAAAAAAAAAAA 18 00 

Qy 1801 AAAA 1804 

till 

Db 1801 AAAA 1804 



RESULT 3 

US-09-551-853D-1 

; Sequence 1, Application US/09551853D 
; Patent No. 6500667 
; GENERAL INFORMATION: 

APPLICANT: GURNEY ET AL . 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 

FILE REFERENCE: 29915/6280L 
; CURRENT APPLICATION NUMBER: US/ 09/551 , 8 53D 

CURRENT FILING DATE: 2000-04-18 

PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US99/20881 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 60/101,594 
; PRIOR FILING DATE: 1998-09-24 

NUMBER OF SEQ ID NOS : 73 

SOFTWARE : Patentln version 3.1 
; SEQ ID NO 1 

LENGTH: 18 04 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-551-853D-1 



Query Match 100.0%; Score 1804; DB 4; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

I I t ! I I I I M I I ! I I I I I I I I I I I I I I I I I I | | | | I | | | | | | | | | f | | | | | | | | | { | | | | 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 
GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

I ' N I I I I M I I I I I I I I I t I I I I M M I I I I I I I f I I I I I I I M | f | | | | | | | | | | | | | 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 
CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

N I I I I I I I I I I I I If I I M I I I I t I I I I I I I II I M I I I I I I I I t I I I I I I I I I I I I I I 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 



Qy 



181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 2 4 0 
I I I I M I I I I I I I I I I I I I I | | | | M I I I I i I I I I I I I I I I | | | | | | | | | | | | | | | | | | | 



Db 



181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 



Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 
1 i 1 M 1 t 1 II 1 1 1 1 1 1 I M 1 1 1 M 1 1 1 II I I I I I I I I | | | | | | | | | | | | | | | | | | | || j | 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


241 


300 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 

1 1 1 1 1 M M 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 I I 1 1 
C C C C C GC AGAAG CT ACAGAT T CT CGT T GACAC T G GAAGC AGTAACT T T G C C GT G GCAG GA 


360 


Db 


301 


360 


Qy 


361 


AC C C C GCACT C C T AC ATAGACAC GT ACT T T GAC ACAGAGAG GT C T AGCAC AT AC C GCT C C 

1 1 1 1 1 1 1 1 1 1 1 1 { 1 1 1 1 | | | 1 1 1 1 1 1 1 1 1 | | | 1 1 1 I 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 | M 1 1 1 1 
AC C C C G CACT C CT AC ATAGACAC GT ACT T T GAC ACAGAGAG GT CT AGCAC AT AC C G C T C C 


420 


Db 


361 


420 


Qy 


421 


AAGGG C T T T GAC GT C ACAGT GAAGT ACAC AC AAGGAAG CT G GAC GGG CT TCGTTGGG GAA 
1 N 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | | | | | | | | 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 | | | | 
AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


421 


480 


Qy 


481 


GAC CTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGC CACT ATT 
' 1 t 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | M | | | | | | | | | | | | | | | | | | | | | | | | | 

GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


481 


540 


Qy 


541 


T T T GAAT C AGAGAAT TTCTTTTTGCCT G GGAT TAAAT GGAAT GGAAT ACT T GG C C T AGCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAAT GGAAT GGAAT ACTTGGCCTAGCT 


600 


Db 


541 


600 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 

1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I | | | | | | | 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


601 


660 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 
1 ' 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | 
CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


661 


720 


Qy 


721 


GGAT C T GGGAC CAAC G GAGGT AGT CT T GT C TT G GGT GGAAT T GAAC CAAGT T T GT ATAAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 

GGAT CT G GGAC CAAC G GAGGTAGT CT T GTCTTGGGT GGAAT T GAAC CAAGT TTGT AT AAA 


780 


Db 


721 


780 


Qy 


781 


G GAGAC AT CT G GT AT AC C C CT AT T AAGGAAGAGT GGT AC T AC C AGAT AGAAAT T C T GAAA 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

G GAGAC AT CT GGT AT AC C C CT AT T AAG GAAGAGT G GT AC T AC C AGAT AGAAAT T CT GAAA 


840 


Db 


781 


840 


Qy 


841 


T T G GAAAT T GGAGG C CAAAGC CT T AAT CT G GACT GCAGAGAGT ATAAC GC AGACAAG G C C 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 I I I | | | | || | | | 

TTGGAAATTGGAGGCCAAAGCCTT AAT CTGGACT GCAGAGAGT AT AACGCAGACAAGGCC 


900 


Db 


841 


900 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 

1 1 1 1 1 N i 1 1 1 M 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | 1 1 1 1 1 1 1 1 1 | | | | | | | 1 1 1 1 1 1 1 1 1 
ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 


Db 


901 


960 


Qy 


961 


GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 
i 1 1 1 1 1 1 1 M 1 i 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 i 1 1 I I I || I | | | | | | I | | 
GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 


1020 


Db 


961 


1020 


Qy 


1021 


CAGCTGGCGT GCT GGAC GAATTCGGAAACACCTT GGT CTTACTTCCCTAAAATCTC CATC 

1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I I I I I I | | | | | | | | | | | | | | | M | | | f | | | | | | | | 

CAGCTGGCGT GCT GGACGAATTCGGAAACACCTT GGT CTTACTTCCCTAAAATCTCCATC 


1080 


Db 


1021 


1080 



10 81 TAG C T GAGAGAT GAGAACT C CAG C AGGT CAT T C C GT AT C ACAAT C C T GC C T CAG CT T T AC 114 0 

I I I I I I I I I I I I M I I M I I I I I I I I I I I II I I I | | I I I I I I I | | | | | | | | | | | | | | M | 
1081 T AC CT GAGAGAT GAGAAC T C C AGC AG GT CAT T C C GT AT C ACAAT C CT GC CT CAGCT T T AC 114 0 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1200 

N I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | If | | | | | I I I I I I I M I I | M I I I I I 
1141 AT T CAGC C CAT GAT GGGG GC C GGC C T GAAT TAT GAAT GT T AC C GAT T C G GC AT T T C C C CA 1200 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

M I I I I I I I I I 1 I II I If I II II M I I I I I I II I I I I I I I I | | | | | | | | | | | | | | | | | | | 
12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

1261 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1320 

I I I I I I I I I I I I I I I I I I | I | | | | | I I I I I I I f I I I I I I I I | | | | | | | | | | | 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

N I I I I I I I M I I I II I I I I I I I I I I | If | | | | | | I I I I I I I I I I M I I I I I f I I I I I I I 
1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 13 8 0 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 
I I I I I I I I I I I M II I I I I I I I I I I | | | | | | | | ff | | | | | | f| | | | | | | | | , | | , 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 144 0 

1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

f I I I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I I I I I I I I | | | | | f | | | (I I I I I I I I I I 
14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

1501 C GT GAC C C T GAGGT C GT C AAT GAT GAGT C CT CT CT GGT C AGAC AT C GC T GGAAAT GAAT A 1560 

I I M I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I | | I I 

1501 C GT GAC C CT GAG GT C GT CAAT GAT GAGT C CT CT CT GGT C AGAC AT C G CT G GAAAT GAAT A 1560 

1561 GC CAGGCCT GAC CT CAAGCAAC CAT GAACT CAGCTAT TAAGAAAAT CACATTT CCAGGGC 1620 

I N I I I I I I I I I I I I I I I | | | f | | I I I I f I I I I I | | | | | | | | | | | | I I I If I I I I I I I I I 
1561 G C C AGGCC T GAC CT CAAGCAAC CAT GAACT CAGC TAT TAAGAAAAT C AC AT T T C C AGGG C 162 0 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

I I I I I I I i I I I I I I I M I I I I I I I II I i I I i I I I I I I I I I I I I I || I I I | | | | | | | f | | | 
1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 
I I f I i I I I I I I I I I I If | | | | | | | | | | | | | If | | | | | | | | | | | | | | [ || | | j | ,,, | ,, , 

1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

17 41 C T C C C T AC T T C C AAGAAAAAT AAT T AAAAAAAAAAC T T CAT T C T AAAC C AAAAAAAAAAA 18 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | 

1741 C T C C C T AC T T C CAAGAAAAAT AAT T AAAAAAAAAAC T T CAT T CT AAAC C AAAAAAAAAAA 1800 

1801 AAAA 1804 
I I I I 

1801 AAAA 1804 
5-450-18 

ce 18, Application US/09215450 



; Patent No. 6635748 
; GENERAL INFORMATION : 

APPLICANT: Giese, Klaus 
; APPLICANT: Xin, Hong 

; TITLE OF INVENTION: METASTATIC BREAST AND COLON CANCER REGULATED GENES 

FILE REFERENCE: 1451.100 / 210030.447 
; CURRENT APPLICATION NUMBER: US/09/215,450 
CURRENT FILING DATE: 1998-12-17 
NUMBER OF SEQ ID NOS : 27 
; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 18 

LENGTH: 1873 
TYPE: DNA 
ORGANISM: human 
US-09-215-450-18 



Query Match 98.0%; Score 1768.2; DB 4; Length 1873; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 1770; Conservative 0; Mismatches 3; Indels 0; Gaps 



Qy 


i 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 

1 1 1 1 M 1 1 I! 1 1 1 I f 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I it i i t i i i i i i i i t i i i i t i 

1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


101 


160 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 
1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1! 1 I 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I 1 1 1 1 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


161 


220 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

1 1 1 1 E 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I M 1 1 M 1 1 I I 1 I 1 1 M I 1 1 1 1 1 f 1 1 I 1 1 1 1 I M I I I i I i i 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


221 


280 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

1 1 1 1 1 1 ( 1 M 1 1 1 1 1 M 1 1 f 1 1 1 I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


281 


340 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 
' 1 1 1 1 ' 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 j | | | | | | | | | | | | | | f | | | j | | | | | | | | || | | | 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


341 


400 


Qy 


301 


C C C C C GCAGAAG C T AC AGAT T C T C GT T GACACT G GAAGC AGT AACT T T GC CGT G GC AGGA 

1 1 1 N 1 I 1 ! 1 f M 1 1 I 1 1 1 1 I I f| M I 1 1 I 1 1 1 1 1 1 I M 1 1 I 1 1 M 1 1 I ! | M 1 1 1 1 1 1 I 

C C C C C G C AGAAGC TAC AGAT T C T C GT T G ACAC T GGAAGC AGTAACT T T GC C GT GG C AG GA 


360 


Db 


401 


460 


Qy 


361 


AC C C C GC ACT C CT AC AT AGACAC GT AC T T T GACACAGAGAG GT C TAGCAC AT AC C G C T CC 

1 1 1 1 1 1 N 1 1 1 1 1 1 M 1 1 1 1 1 1 1 i 1 1 1 ! I I M 1 [ 1 1 1 I I I I | | | | | | | | | | | | | | | | | | | 

AC C C C GC ACT C CT AC AT AGACAC GT AC T T T G ACACAGAGAGGT C TAG CAC ATAC C GC T C C 


420 


Db 


461 


520 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 
1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 | | | | | | | | | | || | | | | | | | || | | | | | | | | | | | | | | || 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


521 


580 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 

N 1 1 1 N 1 1 1 j I I I I 1 1 1 1 1 I 1 1 1 1 1 1 1 1 j 1 1 1 | E | | 1 M 1 1 1 1 1 1 i 1 1 1 | 1 1 1 E 1 1 1 1 1 
GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


581 


640 


Qy 


541 


T T T GAAT CAGAGAAT TTCTTTTTGCCTGG GAT TAAAT G GAAT G GAAT ACT T G GC CT AGCT 


600 



641 TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 700 
601 T AT GC C ACAC T T G C CAAGC CAT CAAGT T CTC T GGAGAC C T T CT T C GAC T C C C T GGT GACA 660 

N I I I I I I I I I I I I I I I I I I I I I ! I I I M I I M I I I I I l I I I I I i i f i i i i i i i i , i i i i 

7 01 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 7 60 

6 61 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

' I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I | | | | | | E | | ! | | | | | | | j | | | | 

7 61 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 82 0 

721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 780 

' I I I I I I ! M If I I I I I I I I I I | | | | I I I I I I I I I I I | | | | | | | | | | | | | | | | I II I I I I 
821 GGAT C T GGGAC CAAC GGAG GT AGTC T TGTCTTGG GT GGAAT T GAAC CAAGT T T GTATAAA 880 

781 GGAGACAT CT GGT AT ACC C CTATTAAGGAAGAGT GGTACTACCAGATAGAAATT CTGAAA 84 0 
I I i I I I f I I I I I I I I I I I I I I ! I II I I I II I I I II I I I I I | | | | | | | | | | | | | | | | | | | | 

8 81 GGAGACAT C T GGT AT AC C C CT AT TAAGGAAGAGT GGT ACTAC CAGAT AGAAAT T CT GAAA 94 0 

841 T T GGAAAT T GGAG G C CAAAG C CT TAAT CT GGAC T GCAGAGAGTATAAC GCAGAC AAGGC C 900 

I N I I I I I I I I I I I I I I I | | | | | | | | f | | | | | I I I I I I I I I I I I I I I I I I I I | | | I I I I I 
941 TT GGAAAT T GGAGGCCAAAGC CTTAAT CT GGACT GCAGAGAGTATAAC GC AGAC AAGGC C 1000 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 
N I M I I M I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | I I I I I I I I | I I I I 
1001 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1060 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

I I I I I N I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I If I I I I I I II ! I M I I I I I I I 

1061 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 112 0 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

I I I I I I I I I I I f I I I I f I I t I I I I I I I I | | | | | | | | | | | | | | | | | | I I I I I I I I I I I I I I 
1121 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 118 0 

1081 T AC CT GAGAGATGAGAACT C C AGCAGGT CAT T C C GT AT CACAAT C CT GC CT CAG C T T T AC 114 0 

I I I I I N I I I I I I I I I I I I I | | | I I I I I I I I I I f I | | | | | | | | | I I I I I f I I I I I I | | | I 
1181 T AC C T GAGAGATGAGAACT C CAG C AGGT CAT T C C GT AT CACAAT C CT G C C T C AGCTT T AC 124 0 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 0 0 

I > I I I I I i I I M I f M I I I I M I I I I I I f I I I I | | | | | | | | I I I I I I I I I I I I | | | [ | M 
1241 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1300 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

i N I I f I I I I I I I I | | | | | | | | | | | | I I I M I I I I I I M I I I I I I I I I | | | | | | | I I I I I 

13 01 T C CACAAAT GC G CT GGT GAT C GGT GCCAC GGT GAT GGAG GGCTTCTACGT CAT CTTC GAC 13 60 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

I I I I M I I I I I If I I I I f I I I | I | | M I I I I I I I I I I I f I I I f I ! | | | | | | | | | | | | | I I 

13 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 142 0 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

I I I M I I I I t I I I I ! If I I I I | | | | | | | | | | | | I I I I I | I I I I I I I I I I | | | | | | | | | | | 
1421 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 14 80 

13 81 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

' I I I I I I I I I I I I I I M I I I I I f | | | | | | |[ I I I f I I I I I I I M I 



Db 


1481 


Qy 


1441 


Db 


1541 


Qy 


1501 


Db 


1601 


Qy 


1561 


Db 


1661 


Qy 


1621 


Db 


1721 


Qy 


1681 


Db 


1781 


Qy 


1741 


Db 


1841 



CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGT 



GGA 1540 



' I I I I I I I I I I I t II I I I I I II I I I I I I II I I I M | | | | | | | M I II I I II I M I I II I I 
GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1600 

C GT GAC C CT GAG GT C GT CAAT GAT GAGT CCTCTCTGGT C AGAC AT C G C T G GAAAT GAAT A 15 60 

H M I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | M I I I I 

CGT GAC C C T GAG GT C GTCAAT GAT GAGT C CT C T C T GGT C AGAC AT C G CT GGAAAT GAAT A 1660 

GC C AGG C CT GAC C T CAAGCAAC CAT GAACT C AGC TAT TAAGAAAAT CAC AT T T C CAG GGC 162 0 

I ' I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I | | | | 

GC CAG GC CT GAC C T CAAG CAAC CAT GAAC T C AGCTAT TAAGAAAAT CACAT TT C CAG GG C 172 0 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

<> 1 I I I I I I I I I I I II I I I I I I I M I I I I I I I I I I I | I I I I I I II I II I I I I I I I | | | | I 
AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 17 8 0 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

"I I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I M I I I I I | | | I 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 18 4 0 

CTCCCTACTTCCAAGAAAAATAATTAAAAAAAA 177 3 
I M I I I I I I I I I I I I I I I I I II I I I I I I I I 
CTCCCTACTTCCAAGAAAAAAAAAAAAAAAAAA 187 3 



RESULT 5 
US-08-999-723-1 

; Sequence 1, Application US/08999723A 

; Patent No. 6025180 

; GENERAL INFORMATION: 

; APPLICANT: Powell, David J. 

; APPLICANT: Southan, Christopher 

; APPLICANT: Chapman, Conrad G. 

APPLICANT: Evans, Joanne R. 

TITLE OF INVENTION: ASPl 

FILE REFERENCE: GH7 02 62 

CURRENT APPLICATION NUMBER: US/08/999, 723A 
CURRENT FILING DATE: 1997-10-06 
NUMBER OF SEQ ID NOS : 2 
SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 1 

LENGTH: 18 62 
; TYPE: DNA 

ORGANISM: Homo sapiens 
US-08-999-723-1 



Query Match 97.9%; Score 1765.6; DB 3; Length 1862; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 17 68; Conservative 0; Mismatches 4; Indels 0; Gaps 0; 

Qy 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

' I I 1 1 f I I I I I I I I I I I I I I I I I I I I I I I I I I | | | I I I I I I I I I I | | | | | I I I I 

Db 91 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 150 

Q y 61 GCCCCGGAG CTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 



151 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 210 

121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 180 
I f I I I I I M II II | | | | | | | | | | | | | | | | | , | , | , , | | | | | , | , , , 

211 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 2 7 0 

181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 2 4 0 

1 > I I I I i I M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | II I I 

271 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 330 

241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

I I I I I I I II I I | | M M I I M I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | 

331 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 39 0 

301 C C C C C G CAGAAGCT AC AGAT T CT C GT T G AC ACT GGAAG CAGTAAC T T T G C C GT GGCAGGA 360 

"' I I I I I I I I I I I I I I I I I I I | | | | | | | | | M I I I I | | | | | | | || | | | | | | | | 

391 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 450 

361 AC C C C GC ACT C CT AC AT AGAC AC GT ACT T T GAC ACAGAGAGGT CTAG CAC ATAC C G C T C C 42 0 

' ' M M I I I I I I I I I I II I I I I I I | | | | | | I I I I I I I I I I I I I M | I I I I I I I I I I II II 
4 51 AC C C C GCACT CCTACATAGACACGTACTTT GACACAGAGAGGT CTAGCACATACCGCT C C 510 

421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 480 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I II I M I I II I I 

511 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 57 0 

481 GAC C T C GT CAC CAT C C C CAAAGGC T T CAAT ACTT CT T T T C T T GT CAACAT T GC C ACT AT T 54 0 

" I I I I I I I I I II M I I I I I I I I M I I I I I I I I I I I I I I I I I M I I M I II I I 

571 GAC CTCGTCACCATCCCCAAAGGCTTCAAT ACTT CTTTTCTTGT CAACAT TGCC ACT ATT 63 0 

541 TT T GAAT CAGAGAAT TTCTTTTTGCCT GG GAT T AAAT GGAAT GGAAT AC T T GGC C TAGC T 60 0 

I I I I I I I I I I I I I I I I I I I | | I I I I || I I || | | | | | | | | | | | | | | | | | || | | | | | 

631 T T T GAAT CAGAGAAT TTCTTTTTGCCT GG GAT TAAAT GGAAT G GAAT ACT T GG C CT AG CT 690 

601 TAT G C C ACAC T T G C CAAGC CAT CAAGT T C T C T GGAGAC CT T C T T C GACT C C CT GGT GACA 660 

I >' I M I I M I I I I I | | M I I I I I I I I I I M I I I I I I I I I I I I M I I I I I M I I I I I I I I 
691 TAT GCCACACTTGCCAAGCCATCAAGTTCTCT GGAGAC CTTCTTCGACTCCCTGGTGACA 750 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGT GTGGAGCCGGCTTGCCCGTTGCT 72 0 

1 I I I I I I I I I II I I I I I I I I I M I I I I I I I I I M I I | | | | | | | | | | | | | | | | | | | 

751 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 810 

721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 780 

1 ' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | M I I I I I I I I I I I I I II I I I I I II 
811 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 870 

781 GGAGAC AT C T GGT AT AC C C CT AT TAAG GAAGAGT GGT ACTAC CAGAT AGAAAT T C T GAAA 84 0 

I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I | | | | | I I I I I I I 

871 G GAGAC AT CT G GT AT AC C C CT AT T AAGGAAG AGT G GT AC T AC CAGAT AGAAAT T CT GAAA 930 

841 TT G GAAAT T G GAG G C C AAAGC CT T AAT C T GGAC T GCAGAGAGT AT AAC G C AGAC AAGGC C 900 

I I I I I I I I N I N I I I I I I I II M I I I I I I I I I I I I I I I | I I I I I I I I I I M I I I I I I I I 
931 T T GGAAAT T GGAGGC CAAAGC C T TAAT CT GGACT G C AGAGAGTATAAC GC AGACAAGG C C 9 90 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 
I I " I ' I I I I ' I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 



991 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1050 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 
I I I I I I M I I I I I I | I I I I I I I | | | || | | | | | M | | | | | | | | | | | | | || | | | | , , 

10 51 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1110 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 10 8 0 

' I I I I I I I I M I I I I I I I | | | | | | I I I I I I I I | I I I I 

1111 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 117 0 

1081 TAC CT GAGAGAT GAGAACT C C AGCAGGT CAT T C C GTAT CAC AAT C CT GC CT C AGCT T T AC 114 0 

" I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | || | | | | | | | | | | | | , ,,,,,, 

1171 TAC CT GAGAGAC GAGAACT CC AG CAGGT CAT T C C GTAT CACAAT CCT GC C T C AGC T T TAC 1230 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 0 0 

I N I M I M M I I I M I I I I I I I I M I I I I II I I I f I I I I I I I I I | | | | | | | | | | | I I II 

1231 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 90 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

'<' I M I I II I I I I I I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I f I I | | | | | | | | 
12 91 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1350 

12 61 AGAGC C C AGAAGAG G GT G GG C T T C GC AGC GAG C C C C T GT G C AGAAAT T G C AG GT GCTG CA 132 0 

' I I I M I M I I I M I I I I I I I 11 I M I I I I I I I I I I I | | | | | I I I I I I | | | | | | | | M M 
1351 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1410 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

I ' I M I I I I I I I I I I I I i I I I I I I I I I I | | | | | | | | | | | | | M | I I I I I I | | | | | I I I I I 
1411 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 147 0 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 144 0 

I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | 

1471 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1530 

1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

I I I I I I M I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | I I I I I I I | | | | | | | | | | | I I I I 
1531 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1590 

1501 C GT GAC CCT GAGGT C GT CAAT GAT GAGT CCTCTCTG GT CAGACAT C G CT GGAAAT GAATA 1560 

1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | I I I I I I I I I I | | | | | | | | 
1591 C GT GAC C CT GAGGT CGT CAAT GAT GAGT C C T C T CT GGT CAGACAT C G C T GGAAAT GAATA 1650 

1561 GC CAGGC C T GAC CT CAAGCAAC CAT GAACT C AGC TAT TAAGAAAAT CAC AT T T C CAG GG C 162 0 

>' ' I I M I I I I I I I I I | I I I I I I I I M I | | f | | | | | I I I I M I I I I I I I I | M I I M I I I 
1651 GC C AGG C C T GAC C T CAAGCAAC CAT GAAC T C AG CT AT TAAGAAAAT CAC AT T T C CAGG GC 1710 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

N I I I I I II I I I II I I | I I I I I I I I I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
1711 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 177 0 

1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

I I I M M I I I I I I I I I I If I I I I I I I I I I I I I I I I f I I M I I I I I I I I If I I M I I I I I I 
1771 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 18 30 

17 41 CTCCCTACTTCCAAGAAAAATAATTAAAAAAA 1772 

I I I I I I I I I I I I I I I I I I I I II | | | | | | | 
1831 CTCCCTACTTCCAAGAAAAAAAAAAAAAAAAA 18 62 



RESULT 6 
US-09-434-427-1 

; Sequence 1, Application US/09434427 
; Patent No. 6162630 
; GENERAL INFORMATION: 

APPLICANT: POWELL, DAVID J. 

APPLICANT: SOUTHAN, CHRISTOPHER 
; APPLICANT: CHAPMAN, CONRAD G. 
; APPLICANT: EVANS , JOANNE R. 

TITLE OF INVENTION: ASP1 

FILE REFERENCE: GH-70262-D1 

CURRENT APPLICATION NUMBER: US/ 09/434 , 427 
CURRENT FILING DATE: 1999-11-04 
EARLIER APPLICATION NUMBER: US 08/999,723 
EARLIER FILING DATE: 1997-10-06 
EARLIER APPLICATION NUMBER: UK 9626022.9 
; EARLIER FILING DATE: 1996-12-14 
NUMBER OF SEQ ID NOS : 2 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 1 

LENGTH: 18 62 
TYPE: DNA 

ORGANISM: HOMO SAPIENS 
US-09-434-427-1 



Query Match 97.9%; Score 1765.6; DB 3; Length 1862; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 1768; Conservative 0; Mismatches 4; Indels 0; Gaps 



Qy 


l 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 
1 f 1 1 1 1 1 M I 1 M 1 I 1 1 I ! M I I I ! I | | | | | | | | | | | | | | | | | | | | | | | M | , f, , | | ,, 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


91 


150 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 

M 1 1 1 M 1 1 M 1 I I I | M | | | M | | | | | | | | | , | , , | , j ,, | j , ,, , | 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


151 


210 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

1 ' 1 1 1 1 1 1 I 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 I 1 1 I I I 1 1 1 1 1 1 1 1 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


211 


270 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | II 1 M 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


271 


330 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 

' 1 ' < M M 1 M 1 1 1 M 1 1 1 1 1 1 1 M 1 1 I I I I 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I I I I I I I I 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


331 


390 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 
1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II If 1 1 1 1 1 I I I I I | | f | | | | | 1 1 1 1 1 I I I I | | | | I | | 
C C C C C GCAGAAGCTAC AGAT T C T C GT T GAC ACT G GAAG C AGTAAC T T T G CC GT G GC AG GA 


360 


Db 


391 


450 


Qy 


361 


AC C CC GCAC T C CT AC ATAGAC AC GT ACT T T G ACAC AGAGAGGT C TAG CACAT AC C GCT C C 

' 1 1 1 1 1 1 1 1 1 1 I I | II 1 1 1 M 1 1 1 1 1 I 1 1 I 1 I M 1 1 1 1 1 1 II 1 1 1 I 1 1 1 1 1 1 II 1 1 1 1 1 1 


420 



4 51 ACC C C G CAC T C CT AC AT AGAC AC GT ACTT T GAC ACAGAGAGGT CT AG C AC AT AC C GC T C C 510 
421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 80 

1 > I I I I I M I f I I II M I I I I I I I I | I I l | | | | | m | M I I ! I I M I I I I I I I I M f I I I 

511 AAG G G C TT T GAC GT C ACAGT GAAGT AC AC ACAAG GAAG CT G GAC GG GCTTCGTT G GG GAA 57 0 

481 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

' ' I I I I I I I I I I I I M I I I M I I I I I I I | | I I I I I I I I I | | | I I I I I I I I I I I I I 

571 GACCTCGT CAC CAT CCCCAAAGGCTTCAAT ACTT CTTTTCTTGTCAACATTGCCACT ATT 630 

541 T T T GAAT CAGAGAAT TTCTTTTTGCCT GGGAT T AAAT GGAAT GGAAT AC TT GGC C T AGCT 600 

1 1 ' 1 ' I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I ! | | | | | | | | | M M I I I I 

631 TTT GAAT CAGAGAATTTCTTTTTGCCTGGGATTAAAT GGAAT GGAAT ACTT GGC CTAGCT 690 

601 TAT GC C ACACT T GC CAAGC CAT CAAGT T CT CT G GAGAC CT T C TT C GAC T C C C T G GT GAC A 660 

1 1 ' 1 ' I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | I I I M M I I I I I I I I I I I I I I I I 
691 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 75 0 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

I I I ' I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | M I I I | | | I I I I I I I II I M I 

751 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 810 

721 GGAT C T G GGAC CAAC GGAGGT AGT C T T GT CT T G GGT GGAAT T GAAC CAAGT T T GT AT AAA 7 80 

" I f I M ! I M I I I I I I I I I I | | M I I I I I I I I I | | | | | | I I I I I I II I I I | | | I 

811 GGAT CTGGGACCAACGGAGGT AGT CTTGTCTTGGGTGGAATT GAAC CAAGT TTGT AT AAA 87 0 

7 81 GGAGAC AT CT GGT AT AC C C CT AT T AAG GAAGAGT GGT ACT AC C AGAT AGAAATT CT GAAA 84 0 

1 ' I I I I I I I M I I I I I I I I I I | | | | | | | f | | | | | | | | | |f I | | ! | | | | f | | | ,, | ,,,, , 
871 G GAGAC AT C T GGT AT AC C C CT AT TAAG GAAGAGT GGT ACT AC CAGAT AGAAATT CT GAAA 93 0 

841 T T GGAAAT T G GAGGCCAAAGC C T TAAT C T G GAC T GC AGAGAGT ATAAC GC AGACAAGGC C 900 

" I t I I I I I I I I I I I I I | I I I I I I I I I I I I I I I | | | I I I I I I I I I I I I I I I I I I I 

931 T T GGAAAT T GGAG G CCAAAGC CT TAAT C T GGAC T GC AGAGAGT ATAAC GC AGAC AAG G C C 990 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

" I II I I I I I I I I I I I | f I I I I I I I I I I I I I I I I | | | | | | | | | I I I I I I I I | | | | | I I I I 
9 91 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 10 50 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 
' 1 > f I I I I I I I I I I I | I I I I M I I I I I I | | | | | | | | | | | | | | | I I I I I I | I I I I I I I I I I 
1051 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1110 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

I I I I I I I I I I | | | | | | | | I I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I | | | | I 

1111 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 117 0 

1081 T AC CT GAGAGAT GAGAAC T C CAGC AG GT CATT C C GT AT CACAAT C C T G C C T CAG C T T T AC 114 0 

1 1 ' I I I I I I I I I I I I I I I I | | | | | | | | | || | | | | | | | | | | | | | , , | | | , | |, | , , | , ,| 
1171 T AC C T GAGAGAC GAGAACT C CAG CAG GT CAT T C C GT AT C ACAATC C T GC C T CAG CTT T AC 1230 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1200 

1 ' I M I II I M I I I I I I | | I I I I I I I I I I I f I I I I I | | | | | | | | f M I I I I I I I | | | | | | 
1231 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1290 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

I I i M I I I I I I I I I I I I I I | | I I I I I I I I I I I I I I I | | | | M I I f I I I I I I I I I I I I I I I 
1291 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1350 



Qy 


1261 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGrTrrA 

1 1 1 1 M M 1 1 i 1 1 1 1 f 1 1 1 1 1 1 1 1 ! 1 1 1 1 I I f | | | f M | | I | M 1 1 1 1 1 I I I I | | f M II 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


1 oon 
± Ji U 


Db 


1351 


-L 4 ± U 


Qy 


1321 


GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCrrGrT 

1 1 1 1 M 11 1 1 M 1 1 1 1 1 1 I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 M 1 1 1 1 I 1 I I I I I I | fl 1 1 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


X o O u 


Db 


1411 


i a n n 


Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGA^r(^TrTr r prr a 

f 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | M 1 1 1 1 1 

CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


i a a n 

X H. H U 


Db 


1471 


1 R Q A 


Qy 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTrAGP arrrrrrr 

M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I I I I I 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


t el n n 


Db 


1531 


1 c: o n 

i o y U 


Qy 


1501 


CGTGACCCTGAGGTCGTCAATGATGAGTCCTCTCTGGTCAGACATCGCTGGAAATrAATA 

1 N 1 1 1 1 1 1 1 I 1 1 1 1 1 I I 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 

C GT GAC C CT GAG GT C GT CAAT GAT GAGT C CTCTCTGGT C AGACAT C GC T GGAAAT GAAT A 


IjdU 


Db 


1591 


i a r n 


Qy 


1561 


GC C AG G C C T GAC C T CAAG CAAC CAT GAACT C AGCT AT TAAGAAAAT C A C A T T T r r A r r r r 

I 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | f 1 1 1 1 | M 1 1 1 I 1 1 | | | | | | | | | 1 1 1 | | | | 

GC C AGGC C T GAC CT C AAGCAAC CAT GAACT CAGC TAT TAAGAAAAT C AC AT T T C CAGG GC 


icon 


Db 


1651 


X / 1 U 


Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCrGTrTTr A ATPTrTr r r r rr , T 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I I M | | | || | | | | | | | | | | | | | | || | | | | | | | , , | | 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


i ^ q n 


Db 


1711 


177 0 


Qy 


1681 


GCT C C C AGAT G C CT T CT AGAT T CACT GT CT T TT GATT C T T GAT TT T CAAGCT T T CAAAT C 

M 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 I I I I If | | | | | 1 II 1 f 1 1 1 1 1 1 1 1 1 1 1 I I I | | 1 1 1 1 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1771 


1830 


Qy 


1741 


CT C C CT ACT T C CAAGAAAAAT AAT TAAAAAAA 1772 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II | || | | | | 
CTCCCTACTTCCAAGAAAAAAAAAAAAAAAAA 1862 




Db 


1831 





RESULT 7 
US-09-215-450-1 

; Sequence 1, Application US/09215450 

; Patent No. 6635748 

; GENERAL INFORMATION: 

; APPLICANT: Giese f Klaus 

; APPLICANT: Xin, Hong 

; TITLE OF INVENTION: METASTATIC BREAST AND COLON CANCER REGULATED GENES 

FILE REFERENCE: 1451.100 / 210030.447 

CURRENT APPLICATION NUMBER: US/ 09/2 15 , 4 5 0 
; CURRENT FILING DATE: 1998-12-17 

NUMBER OF SEQ ID NOS : 27 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 1 

LENGTH: 2429 

TYPE: DNA 

ORGANISM: human 
US-09-215-450-1 



Match 82.0%; Score 1478.6; DB 4; Length 2429; 

Local Similarity 99.4%; Pred. No. 0; 

es 1484; Conservative 0; Mismatches 9; Indels 0; Gaps 



0 



305 



C G CAGAAG CTAC AGATT CT C GT T GACAC T GGAAG C AGT AAC T T T GC C GT GG CAG GAAC C C 3 64 

1 I I I I I I f I I M i ! I | | | | | | | | | | | t | I | f f | | | | i | | | | | | | , | | | | , | | | 

213 CTCACCAGCTAC AGATT CTCGTTGACACTGGAAGCAGTAACTTTGCCGTGCAGGAAACCC 272 



365 
273 
425 
333 
485 
393 
545 
453 
605 
513 
665 
573 
725 
633 
785 
693 
845 
753 



C G CAC T C C T AC AT AGAC AC GT AC T T T GACACAGAGAGGT CT AG C ACAT AC C GCT C CAAG G 42 4 

" M 1 > I M I I I I I I I I I I I I I I I I | | | | | M | | | | | | | | f | | | M ,,,,,,,,,,,, | , 

C G CAC T C CT ACAT AGAC AC GT ACTT T GACAC AGAG AG GT C T AGC ACAT AC C GC T C CAAG G 332 



G C TT T GAC GT CAC AGT GAAGT ACACAC AAGGAAG C T GGAC GG G C TT C GT T GG GGAAGAC C 4 8 4 

I ' M I I I I I I I | | | | | | | | | | | | | | | | | || | | | | | , I I II I I I I f I I I I 

GCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAAGACC 392 

TCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATTTTTG 54 4 
' I I I I I I I I I I | | | | | | | M | | | M I I I M I I I I I I I I I | | | | | | | M I 

TCGT CAC CATCCCCAAAGGCTTCAAT ACTT CTTTTCTTGTCAACATTGCCACTATTTTTG 4 52 
AAT CAGAGAAT TTCTTTTT GC CT GGGAT TAAAT G GAAT GGAAT ACT T GG C C T AGC T TAT G 604 

' 1 1 ' 1 ' I I II I I I I I I I I I | I I II I I i I I I I I M I I I I I I I I ! I I I I I I | | | | I I II I I I 

AAT CAGAGAAT TTCTTTTTGCCT GGGAT TAAAT GGAAT G GAAT ACT T GGC CT AGC TT AT G 512 
C C ACAC T T GC CAAGC CAT CAAGTT CT C T G GAGAC CTT C T T C GACT C C CT G GT GAC AC AAG 664 

1 1 ' i I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I | | | | 

C C ACACT T G C CAAGC CAT CAAGT T CT CT GGAGAC CT T CT T C GAC TCCCTGGT GACAC AAG 572 

CAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCTGGAT 724 

I 1 1 1 I I I I I I I I I I I I I I I I II I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I | | | I I I 
CAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCTGGAT 632 

CTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAAGGAG 7 84 

II > I I I I II II I I I I I II I I I I I I f I I I I I I I II I I I If I I I | | | | | | | | | | | | | | | | | | 

CTGGGACCAACGGAGGTAGT CTT GT CTT GGGTGGAATT GAAC CAAGTTTGTATAAAGGAG 692 
AC AT CT G GT ATAC C C CT AT T AAG GAAGAGT GGT ACTACC AGAT AGAAAT T CT GAAAT T GG 84 4 

1 M 1 > I I I I I I I II I I I I I I I I | | | | | | | | | | | I I I I I II I I M I I I I I I I II I I I I I I I 

ACAT C T GGT ATAC C C C TAT TAAGGAAGAGT GGT AC T AC C AGAT AGAAAT T C T GAAAT T GG 7 52 

AAATT GGAG G C C AAAGC CT TAAT CT GGACT GC AG AGAGT ATAAC GC AGACAAGG C CAT C G 904 

1 11 > I I I I I I I I I I I I | | | I I I I I I I I I I I I I I | | I | | | | | | | | | | | | || | | | | 

AAATT GGAGG C CAAAG C C T TAAT CT G GACT GC AGAGAGT AT AAC G CAGAC AAGGC CAT C G 812 



964 



905 TGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTT GAT GCGGT GGT GGAAG 

I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I | | | | | I I I I I I I I I I I I I 

TGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTGGAAG 872 



813 
965 
873 
1025 



CTGT GGC CCGCGCATCTCTGATTCCAGAATTCTCT GAT GGTTTCT GGACT GGGTCCCAGC 1024 

1 1 1 "" I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I | | | | | | | | | I I I M I I I I I I 

CTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCCCAGC 932 



TGGCGT GCT GGAC GAATTCGGAAACAC CTT GGT CTTACTTCCCTAAAATCTC CAT CTAC C 1084 

" 1 1 1 > I I I I I I I I I I II I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I 

933 TGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATCTACC 9 92 



1085 



T GAGAGAT GAGAAC T C CAG C AG GT CAT T C C GT AT C ACAAT C CT G C CT CAG CT 



TTACATTC 114 4 



Db 


993 


i i i i t i i i i i i i i i i i i i i i i i i i i i ) i i i i i i i i i i i i i i i i t i i i i i i i ■ i t ■ 

' ' 1 1 i 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 II 1 If 1 1 1 M 1 1 ! 1 1 I M I 1 | | | | | | | | | | | | | | | | | 

T GAGAGAT GAGAAC T C CAGCAG GTC AT T C C GT AT CACAAT C CT GC C T C AGC T T T AC AT T C 


1052 


Qy 


1145 


AGC C CAT GAT GGGGG C C GGC C T GAAT TAT GAAT GT T AC C GAT T C G GC AT T T C C C CAT C C A 

1 i 1 | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 ! 1 1 1 1 1 l t l l l l I i i t i i i i i > i > ■ 

1 1 1 ' 1 1 1 1 f 1 I 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 I | | | 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 
AGC CCAT GAT GGGGGCCGGCCTGAATTAT GAAT GTTACCGATTCGGCATTTCCC CAT CCA 


1204 


Db 


1053 


1112 


Qy 


1205 


CAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGACAGAG 

1 i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i ■ i i ■ i , , 

' 1 1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 | | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGACAGAG 


1264 


Db 


1113 


1172 


Qy 


1265 


CCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCAGTGT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 I I I I I I i r i i i i i i i i i i i i t i i i i i i ■ i 

1 1 ' I 1 1 1 1 1 I I I I f I I I I | | | 1 1 1 1 1 | 1 | 1 II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 | 1 1 1 1 1 1 

CCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCAGTGT 


1324 


Db 


1173 


1232 


Qy 


1325 


CTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCTCAGT 
I i i i i i i i [ i i i i i i i i i i i i i t i i i i i i i i i i i i ii i • i i i i , i i i i i 

1 1 1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | | 1 
CTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCTCAGT 


1384 


Db 


1233 


1292 


Qy 


1385 


CTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGAGCCA 
i t i i i i i i i i i i i i i t t i i i i i i i i t i i i i i i i ( i i > 

1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 f f 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I II 1 1 1 
CTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGAGCCA 


1444 


Db 


1293 


1352 


Qy 


1445 


TCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCCCGTG 
i i i i i t i t i i i i i i i i i i i i i i i i i i i i i i i * i i t i i i i i i i i i 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 11 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCCCGTG 


1504 


Db 


1353 


1412 


Qy 


1505 


AC CCT GAG GT C GT CAAT GAT GAGT C C T CT C T GGT CAGACAT C GCT GGAAAT GAAT AGC CA 
i i i j i i i i i i i i i t i i i i i i i i i j i i i ■ i i i i i ■ i ■ i i i i i i ■ i 

>>'' ' 1 1 1 1 1 I 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 f 1 1 M 1 1 1 1 1 1 1 1 

AC CCT GAGGT C GT CAAT GAT GAGT C C T CT CT GGT CAGACAT C G CT GGAAAT GAAT AG C CA 


1564 


Db 


1413 


1472 


Qy 


1565 


GG C CT GACC T CAAG CAAC CAT GAAC T C AGCT AT T AAGAAAAT CACAT T T C C AGGGCAG C A 
i i i t i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i t i iii 

1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 M 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 

GG C C T GACCT CAAG CAAC CAT GAAC T C AGCT AT T AAGAAAAT CACAT T T C CAG GG C AG C A 


1624 


Db 


1473 


1532 


Qy 


1625 


GCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCTGCTC 
t i i i j i i i i i i t i i i i i i i i i i i i i i i i i i i i i ■ i i i i i • i i . . . 

' 1 I 1 1 1 1 If 1 1 f 1 1 1 1 1 1 1 1 1 I 1 I 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I 1 1 1 

GCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCTGCTC 


1684 


Db 


1533 


1592 


Qy 


1685 


CCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATCCTCC 

1 E i 1 M 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | | | | I | | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 I 

CCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATCCTCC 


1744 


Db 


1593 


1652 


Qy 


1745 


C T AC T T C C AAG AAAAAT AAT T AAAAAAAAAAC T T CAT T C T AAAC C AAAAAAAA 17 97 

M ! 1 I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 I 1 1 1 1 1 1 f 

CTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAAACCAAAACAGA 17 05 




Db 


1653 





RESULT 8 
US-09-717-432-1 

; Sequence 1, Application US/09717432 
; Patent No. 6291223 
; GENERAL INFORMATION: 

APPLICANT: ZHU, YUAN 
; APPLICANT: LI, XIAOTONG 

APPLICANT: CHRISTIE, GARY 
; APPLICANT: POWELL, DAVID J. 



TITLE OF INVENTION: Mouse Aspartic Secretase-1 (mASPl) 

FILE REFERENCE: GP-70663 
; CURRENT APPLICATION NUMBER: US/ 0 9/ 7 1 7 , 4 32 
; CURRENT FILING DATE: 2000-11-21 

PRIOR APPLICATION NUMBER: 60/166,974 

PRIOR FILING DATE: 1999-11-23 

NUMBER OF SEQ ID NOS : 2 
; SOFTWARE: Fas t SEQ for Windows Version 3.0 
; SEQ ID NO 1 

LENGTH: 154 5 
; TYPE: DNA 

ORGANISM: MUS MUSCULUS 
US-09-717-432-1 



Query Match 62.9%; Score 1134.2; DB 3; Length 1545; 

Best Local Similarity 83.6%; Pred. No. 5.5e-260; 

Matches 1302; Conservative 0; Mismatches 243; Indels 12; Gaps 



Qy 


1 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 
N 1 M I 1 1 1 1 1 1,1111 1 | | | | | | | | | If 1 1 1 1 II 1 1 1 | | | | | 

ATGGGCGCGCTGCTTCGAGCACTCTTGCTCCTGGTGCTGGCGCAGTGGCTCTTGAGTGCG 


60 


Db 


1 


60 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 
1 IN 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I I | | | | 1 1 1 1 II 1 1 1 1 II 1 1 1 1 II 
GTCCCCGCGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCAAGTGGCCGGGGCCACGAAC 


120 


Db 


61 


120 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 
II N III 1! 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 I 1 1 1 1 1 | | || 
CACAGAGCCTCGGCTGTTCCCGGACTCGGGACCCCCGAGTTGCCCCGGGCCGATGGTCTG 


180 


Db 


121 


180 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

N 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 IN | | | || | | M | | || 

GCCCTCGCACTGGAGCCTGTCAGGGC TACTGCCAACTTCTTGGCTATG 


240 


Db 


181 


228 


Qy 


241 


GT AGACAAC C T G CAG G G GGACT CTGGCCGC GG CT AC T AC C T GGAGAT G CT GAT C GGGAC C 

' 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 

GT GGACAAC C T T CAGGG GGACT CTGGCCGC GG CT ACT AC C T AGAGAT G CT GAT C GGGAC C 


300 


Db 


229 


288 


Qy 


301 


C C C C CGCAGAAGC T AC AGAT T CT C GT T GAC AC T G GAAGC AGT AAC T T T G C C GT GGCAGGA 

H M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I 

CCTCCGCAGAAGGTACAGATTCTTGTGGACACTGGAAGCAGTAACTTCGCTGTGGCAGGT 


360 


Db 


289 


348 


Qy 


361 


AC C C CGCAC T C CT AC AT AGACAC GT ACT TT GACACAGAGAG GT C TAG C AC AT AC C GCT C C 

INI 1 1 1 II 1 1 1 1 1 1 I I I I I I | M | | | | | | | | | | | | | || | | | | 1 1 1 1 1 1 MM 

GC C C CACAC T C CTAC AT AGACAC C T ACT TT GACT CAGAGAG C T C CAG C AC AT AC CAC T C C 


420 


Db 


349 


408 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

1 1 M M 1 II 1 1 II M 1 1 M II II 1 II II 1 1 1 1 1 M M II M Mill II 1 II II 

AAGGGCTTT GAT GTCACTGTGAAGTACACACAGGGAAGCT GGACT GGCTTTGTTGGT GAG 


480 


Db 


409 


468 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 

t 1 1 M 1 M 1 II II 1 II II M 1 1 II 1 If 1 1 II II I MM! 1 M II M 1 II M 

GACCTTGTCACCATCCCAAAAGGCTTCAACAGCTCTTTCTTGGTCAATATTGCCACTATT 


540 


Db 


469 


528 


Qy 


541 


TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 
1 1 1 ' M 1 1 M 1 1 II II M II 1 M 1 II 1 1 II 1 1 II 1 1 | || II II II 1 II II III 
TTCGAGTCTGAGAATTTCTTTTTGCCTGGTATTAAATGGAATGGAATCCTTGGACTTGCT 


600 


Db 


529 


588 



601 



TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 
1 ' 1 1 1 > I' I ' I I I II I I II I I I I I I I I II I I I I I | | I | | | | | | | | |f I 
5 8 9 TATGCTGCTTTGGCCAAGCCATCAAGCTCTCTGGAGACATTTTTTGATTCCCTGGTGGO 



:c 648 



661 
649 
721 



CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 720 
' ' 1 1 1 " 1 'I N II I I I I I M I I I | | | | | | || | || || || |( , . , 

C AAGCAAAGAT T C CAGACAT T T T C T C CAT GC AGAT GT GC G G GG C T G GAT T GC C AGT AG C T 7 08 



G GAT CT G GGAC CAAC GGAGGTAGT C T T GT CT T G G GT GGAAT T GAAC CAAGT T T GT AT AAA 7 8 0 

nnQ 11 111,1 I I I I I I M I I I I I I I I | | | | | I I I I I I I I M I I I I I I I I I I I I I I I I M 

709 GGTTCTGGTACCAACGGAGGTAGTCTTGTCCTGGGTGGGATTGAACCAAGTTTGTATAAA 768 



781 
769 
841 
829 
901 
889 
961 
949 
1021 
1009 
1081 
1069 
1141 
1129 
1201 
1189 
1261 
1249 
1321 
1309 
1381 
1369 



GGAGAC AT C T G GT AT AC C C C TAT T AAGGAAGAGT G GT ACT AC C AGAT AGAAATT C T GAAA 8 4 0 

11111 I I M I I I I I I I I M I I I I I II II I I I I I I II II I I I I I I I I | | | | I 

GGAGATATCT GGTATACCC CAATTAAAGAGGAAT GGTACTATCAAATAGAAATC CT GAAG 82 8 

TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 900 
1 1 "' 1 " " I I ' I ' I I I Ml I I I I I I I I I I I I I I I I I | I I I I I I I I I | | | | | | | | 
T T GGAAAT T G GAGG C C AGAAC C T CAAC CT GGACT G CAGAGAGT AT AAC G C AGACAAG GC C 8 8 8 

ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 
1 ' I I I I I I I I M I I I I I | I I M I I I I I I I I I I I I I I I I | | | I I I I I I I I I | | | | | M 

ATTGTGGACAGTGGCACCACGCTCCTGCGCCTGCCCCAGAAGGTGTTTGATGCAGTGGTG 94 8 

GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 
I I I I I I I I I | | If I I I I I I I | | | | | | | | | M | | I I I I M I I M I I I I Ml II 

GAAGCTGTGGCACGAACATCTCTGATTCCAGAGTTTTCTGATGGCTTCTGGACAGGGGCC 1008 

CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

1 " 1 1 1 I I I M I I I I I I M I I III || | || | | | | I I I I II If I II 

C AGCT GG C AT G CT GGACAAAT T CT GAAAC G C CAT G G G CATAT TT C C CTAAGAT T T CT AT C 1068 

T AC C T GAGAGAT GAGAAC T C C AGC AG GT CAT T C C GT AT C ACAAT C CT GC CTCAG CT T T AC 114 0 

1 1 1 1 " 1 I " I I I I I I I I I M I II Mill Mill || || !| Mm in 

T AC CT GAGAGAT GAGAAT GC CAGT CGCTCCTTC C GGAT C AC CAT T CT C C CAC AGC T C T AC 112 8 
AT T CAG C C CAT GAT GGGGGC C G G C CT GAAT TAT GAAT GT T AC C GATT C GGC AT T T C C C C A 12 00 

1 ' 1 ' 1 I I ' I I I " ' I I I M M I M I I I I I I I I | | | | | | || || || in | 

AT T CAGC C CAT GAT GG GAGC T G GT TT CAAT TAT GAAT GC T AC C GT TT T GGT AT CT C CT CT 118 8 

TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 
1 1 1 I 1 " I ' I ' I I I " I I ' I I M I I II I I I I I | | I I I I I I I I I I I I I | | | HI 

TCCACAAATGCGCTGGTGATTGGTGCGACCGTGATGGAAGGCTTCTACGTGGTCTTTGAC 124 8 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 
1 1 ' " ''I' I II I I I I I I I I I I I I I III I I I I I I | I I I I MM | | II I II 

AGAGCTCAGAGGAGGGTGGGCTTTGCAGTGAGTCCCTGTGCAGAGATTGAAGGTACCACA 1308 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 1380 

111 '''''I I I I I ' I M II I II I II I I II If I M I II II 

GTGTCTGAAATTTCTGGGCCCTTTTCCACGGAAGACATAGCCAGCAACTGTGTTCCAGCA 13 68 



CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 
1 1 ' 1 ' ''I M II II II I I I II I I II I || | M II I M II I | | |f II II I I II 
CAGGCTCTGAATGAGCCCATCTTGTGGATTGTGTCCTATGCCCTGATGAGTGTGTGTGGA 142 8 



14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 150 0 

'fill MM! II I M I M M M M M I i M I I f Ml II MM 

Db 1429 GCCATTCTCCTGGTTCTGATCCTCCTCCTGCTGCTCCCGCTGCACTGCCGTCATGCCCCC 1488 

QY 1501 C GT GAC C C T GAGGT C GT CAAT GAT GAGT C C T C T CT GGT C AGAC AT C GC T GGAAAT GA 1557 

N N II II I II I I II II I II I I I I II I | || M | | I | | | | | |f | | 

Db 1489 C GAGAC C C T GAGGT AGT TAAC GAT GAGT C CT CACTAGT C AGAC AT C G C T GGAAAT GA 1545 



RESULT 9 
US-09-912-484-1 

; Sequence 1, Application US/09912484 

; Patent No. 6358725 

; GENERAL INFORMATION: 

; APPLICANT: Christie, Gary 

; APPLICANT: Li, Xiaotong 

; APPLICANT: Powell, David J. 

; APPLICANT: Zhu, Yuan 

; TITLE OF INVENTION: Mouse Aspartic Secretase-1 (mASPl) 
FILE REFERENCE: GP-70663-D1 

CURRENT APPLICATION NUMBER: US/09/912,4 84 
; CURRENT FILING DATE: 2001-07-25 

PRIOR APPLICATION NUMBER: 60/166,974 
PRIOR FILING DATE: 1999-11-23 
PRIOR APPLICATION NUMBER: 09/717,432 
PRIOR FILING DATE: 2000-11-21 
NUMBER OF SEQ ID NOS : 2 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 1 

LENGTH: 1545 

TYPE : DNA 
; ORGANISM: MUS MUSCULUS 
US-09-912-484-1 



Query Match 62.9%; Score 1134.2; DB 4; Length 1545; 

Best Local Similarity 83.6%; Pred. No. 5.5e-260; 

Matches 1302; Conservative 0; Mismatches 243; Indels 12; Gaps 1; 



60 



Q v 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 

> X ' II I I I I I I I I I I I | II I M I I II I I I I I I I I 

ATGGGCGCGCTGCTTCGAGCACTCTTGCTCCTGGTGCTGGCGCAGTGGCTCTTGAGTGCG 60 



Db 1 



Q y 61 GCCCCGG AGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

' III I I I I M M M M I M M I M I I I II I I I M I II II I I M I I I M M I I I I 
Db 61 GTCCCCGCGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCAAGTGGCCGGGGCCACGAAC 120 



Q y 121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

I ' II Ml I I I M I II I I II II | | | | I I I II I I I I II 

CACAGAGCCTCGGCTGTTCCCGGACTCGGGACCCCCGAGTTGCCCCGGGCCGATGGTCTG 18 0 



Db 121 



Q y 181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 240 

f ' ' I ' I I I I II | | |f II I III 

Db 181 GCCCTCGCACTGGAGCCTGTCAGGGC TACTGCCAACTTCTTGGCTATG 228 

QY 241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 3 00 

I 1 I I H I" II M I I M I I I I II I II II I II I I M I II I M II I I I M I I I I I 

Db 22 9 GTGGACAACCTTCAGGGGGACTCTGGCCGCGGCTACTACCTAGAGATGCTGATCGGGACC 288 



301 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 

N I I I I I I I I I I I I I I I I I I I II I I I I I I f I I I I I I I M I I I I II I I II I I II 
289 CCTCCGCAGAAGGTACAGATTCTTGTGGACACTGGAAGCAGTAACTTCGCTGTGGCAGGT 34 8 

3 61 AC C C C GC ACT C CT ACAT AGAC AC GT AC T T T GACACAGAGAG GT CT AG C AC ATAC CGC T C C 42 0 

till I I I I M I I I I | I I | | I I I I If I I I I I I I I I I I I II I I I I I | I | | | | | | | 
34 9 G C C C C ACAC T C C T ACAT AGAC AC CT AC T T T GAC T CAGAGAG C T C C AG C AC AT AC CACT C C 4 08 

421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 480 
I I I I I I I I I I I I I I I I I I M I I I I I I I I | | I I I I I I I I I I I | | M I I I I I I || 

4 09 AAGGGCTTTGATGTCACTGTGAAGTACACACAGGGAAGCTGGACTGGCTTTGTTGGTGAG 4 68 

481 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

I I ' I I I I I M I I I I I I I I I II I I I I I I I | | | | | | | | I I I I I I I I I I I I I I I 
4 69 GACCTTGTCACCATCCCAAAAGGCTTCAACAGCTCTTTCTTGGTCAATATTGCCACTATT 52 8 

541 TT T GAAT CAGAGAAT TTCTTTTTGCCT GG GAT TAAAT G GAAT GGAAT ACT T GGC C TAG C T 600 

I I H II I I M I I I I I M I II I I I I I I I I I II I | | | | || | | || | | I I I I I I Ml 
52 9 TTCGAGTCTGAGAATTTCTTTTTGCCTGGTATTAAAT GGAAT GGAAT CCTTGGACTTGCT 58 8 

601 TAT GC C AC ACT T G C CAAGC CAT CAAGT T C T CT GGAGAC C T T C TT CGAC TCCCTGGT GAC A 660 

I I I I I I ' I I I I I I I I I I I I I I I I I I I I | | | | | || || || | | I I I I I I | | 
58 9 TATGCTGCTTTGGCCAAGCCATCAAGCTCTCTGGAGACATTTTTTGATTCCCTGGTGGCC 64 8 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

I I I I I I I I II II || I I I I I I I I I | | | | | | | | | I II II || | | | | | || Ml 
64 9 CAAGCAAAGATTCCAGACATTTTCTCCATGCAGATGTGCGGGGCTGGATTGCCAGTAGCT 7 08 

721 G GAT C T GGGAC CAAC GGAG GT AGT C TT GT CT T G GGT GGAAT T GAAC CAAGT T T GT ATAAA 7 80 

I M I I I II I I | | | | | | | | I I I I I I I I I I I I I | | | | | | I I 

7 09 GGTTCTGGTACCAACGGAGGTAGTCTTGTCCTGGGTGGGATTGAACCAAGTTTGTATAAA 7 68 

781 GGAGACATCTGGTATACCCCTATTAAGGAAGAGTGGTACTACCAGATAGAAATTCTGAAA 84 0 

'''I' I I I I I I I I I I I I I I I I I I I || || I I I I I I I I II I I I I I I I I I I I I I 

7 69 GGAGAT AT CT GGT AT AC C C CAAT TAAAGAGGAAT GGT ACT AT CAAAT AGAAAT C CT GAAG 82 8 

841 T T GGAAAT T GGAG GC C AAAGC C TT AAT C T GGACT GCAGAGAGT ATAAC G C AGACAAGGC C 900 

' I I I I I I I I I I M I I I I I IN | I I I I I I I I I I I I I I I I I I I | | | | | | | | | I 

82 9 T T GGAAAT T GGAGGC CAGAAC CT CAAC CT G GACT GCAGAGAGT ATAAC GCAGACAAGGC C 8 8 8 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 
' I 1 I I I M I Ill I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

8 8 9 ATTGTGGACAGTGGCACCACGCTCCTGCGCCTGCCCCAGAAGGTGTTTGATGCAGTGGTG 9 4 8 

961 GAAG C T GT G GC C C GC GCAT CT C T GAT T CCAGAATTCTCT GAT GGT TTCT GGACT G GGT CC 1020 

I ' I I I I ' I ' I I I I M I I I I I | | | || | | | I I I I I I I I I I I I | IN | | 

94 9 GAAGCTGTGGCACGAACATCTCTGATTCCAGAGTTTTCTGATGGCTTCTGGACAGGGGCC 1008 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 10 8 0 

' I I I I I I I I I I M I I I I I I I I | | | | | || | | | | || | | | | | | | | || || in 
10 09 CAGCT GGCAT GC T G GACAAAT T C T GAAAC GC C AT GG GCAT ATT T C C CT AAGAT T T C TAT C 1068 

10 81 TACCT GAGAGAT GAGAACT C CAGCAGGT CATT C CGTAT CACAAT C CT GCCT CAGCT TTAC 114 0 

I I I I I I I I I I I i I I I I I I I I I I II I I I I I I I | | | || || || | | I I I III 
10 69 T AC CT GAGAGAT GAGAAT G C C AGT CGCTCCTTCCG GAT CAC CAT T CT C C C AC AGC T CT AC 112 8 



Qy 


1141 


ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 

1 I 1 I 1 1 1 1 1 1 I 1 1 1 ! 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | I I If II II III 1 

ATTCAGCCCATGATGGGAGCTGGTTTCAATTATGAATGCTACCGTTTTGGTATCTCCTCT 


1200 


Db 


1129 


1188 


Qy 


1201 


TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 

N 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 IE 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 If If 1 1 1 1 III 

TCCACAAATGCGCTGGTGATTGGTGCGACCGTGATGGAAGGCTTCTACGTGGTCTTTGAC 


1260 


Db 


1189 


1248 


Qy 


1261 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 
Mill MM 1 1 1 1 If 1 1 1 1 I I I I I I IN If 1 1 | | | | | | | | | | 1 1 1 II 1 II 
AGAGCT CAGAGGAG GGT G G GC T T T GC AGTGAGT C C CT GT G C AGAGAT T GAAG GT AC C AC A 


1320 


Db 


1249 


1308 


Qy 


1321 


GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 
1 1 M 1 1 1 If 1 1 1 1 1 1 1 1 1 1 II || If II II 1 1 | | | | | | | | | | | | | | If | j 

GTGTCTGAAATTTCTGGGCCCTTTTCCACGGAAGACATAGCCAGCAACTGTGTTCCAGCA 


1380 


Db 


1309 


1368 


Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 
III M III M 1 1 1 1 1 1 1 f If 1 1 f 1 f 1 1 1 I If I I M 1 II f 1 1 f 1 || | | | | | | 
CAG G CT CT GAAT GAGC C CAT CTT GT G GAT T GT GT C C TAT G CC CT GAT GAGT GTGT GT GGA 


1440 


Db 


1369 


1428 


Qy 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 
M M 1 I 1 1 1 1 1 1 1 1 1 1 1 1 M f | | | | | | | | | f | | Ml || | | | | 

GCCATTCTCCTGGTTCTGATCCTCCTCCTGCTGCTCCCGCTGCACTGCCGTCATGCCCCC 


1500 


Db 


1429 


1488 


Qy 


1501 


C GT GAC C C T GAGGT C GT C AAT GAT GAGT C CT C T CT G GT C AGACAT C GCT GGAAAT GA 1557 

M 1 M i 1 1 1 1 1 1 1 it II I | | M 1 i 1 1 1 1 II 1 1 1 1 If 1 1 1 1 1 1 II f 1 1 I I I I 

C GAGAC C CT GAGGT AGT TAAC GAT GAGT C CT C ACT AGT CAGACAT C GCT GGAAAT GA 154 5 


Db 


1489 



RESULT 10 
US-09-280-116-32 

; Sequence 32, Application US/09280116A 
; Patent No. 6331427 
; GENERAL INFORMATION: 

APPLICANT: Robison, Keith E. 
; TITLE OF INVENTION: Nucleic Acid Molecules Encoding Human Protease Homologs 

FILE REFERENCE: 5800-24, 0358 00/17 6965 

CURRENT APPLICATION NUMBER: US/ 0 9/2 8 0 , 1 16A 

CURRENT FILING DATE: 1999-03-26 

NUMBER OF SEQ ID NOS : 2 68 

SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 32 

LENGTH: 2514 
; TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

; OTHER INFORMATION: aspartyl proteases 
US-09-280-116-32 

Query Match 61.2%; Score 1104.6; DB 4; Length 2514; 

Best Local Similarity 92.9%; Pred. No. 7.1e-253; 

Matches 1428; Conservative 0; Mismatches 64; Indels 45; Gaps 24; 

Qy 305 CGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGAACCC 364 

I II I I I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I M It I I I I I I I I I 
Db 213 CTCACCAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGG-AGGAACCC 271 



Qy 


365 


C GC AC T C CT ACAT AGACAC GT ACT T T GAC AC AGAGAG GT C T AGCAC AT AC C G C T C CAAGG 

llll III 1111 llllllltl lilll 111 II mi ■■ ■■■ Lb.. 

1 1 1 I 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 I 1 1 1 t M 1 It 1 t 1 1 1 M 1 I 1 1 I | || | | M | | | | | | | | | 

C GC ACT C CT AC AT AGACAC GT ACTT T GAC AC AGAGAG GT C T AGCAC AT AC C G CT C CAAG G 


424 


Db 


272 


331 


Qy 


425 


GCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAAGACC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 

GCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAAGACC 


484 


Db 


332 


391 


Qy 


485 


TCGTCACCATCCCCAAAGGCTTC-AATACTTCTTTTCTTGTCAACATTGCCACTATTTTT 
1 1 1 M 1 1 1 1 1 1 1 1 1 I 1 1 | | | | | | M | | | | | | | | | | | | | | | M | | | | || | | | j | || | | | | 

TCGTCACCATCCCCAAAGGCTTCAAATACTTCTTTTCTTGTCAACATTGCCACTATTTTT 


543 


Db 


392 


451 


Qy 


544 


GAAT C AGA- GAAT TT C TTTTTGCCT G GGAT TAAAT GGAAT G GAAT ACT T GGC CT AGC T T A 

N 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 
GAAT CAGAG GAAT TT CTTTTTGCCT G GGAT TAAAT G GAAT G GAAT ACT T GGC CT AGCT T A 


602 


Db 


452 


511 


Qy 


603 


TGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTG AC 

1 1 1 M 1 1 1 M 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 
TGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGAACAC 


659 


Db 


512 


571 


Qy 


660 


ACAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCC-GGCTTGCCCGTTG 
1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 
AAGCAAAACATCCCCAAACGTTTTCTCCATGCAGATGTGTGGAGCCGGGCTTGCCCGTTG 


718 


Db 


572 


631 


Qy 


719 


CTGG ATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTT 
1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 I II I I I | | | | | | | | | | | 
GCTGGGATTCTGGGGAACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTT 


773 


Db 


632 


691 


Qy 


774 


GT ATAAAG GAGAC AT C T GGT AT AC C C CT AT T AAG GAAGAGT G GT ACT AC C AGAT AGAAAT 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 i 1 I I I I I I I I || | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

GT ATAAAGGAGAC AT CT G GT AT AC C C CT AT T AAGGAAGAGT GGT ACT AC C AGAT AGAAAT 


833 


Db 


692 


751 


Qy 


834 


TCTGAAATT G GAAAT T GGAG GC CAAAG C CT T AAT CT G GACT G C AGAGAGT A 

Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llll 
T CT T GAAAT T GGGAAATTGGGAGGCCAAAGCTTT AAT TCTTGGGACTGCAGGAGGAGT AT 


884 


Db 


752 


811 


Qy 


885 


TAAC GC AGACAAGG — CCATCGTGGACAGT— GGCACCACGCTGCTGCGCCT — GCCCCAG 

M 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I I I | | | | | | | | | | || I I I I I I 

TAACGCAGACAAGGGCCATCGTTGGACAGTGGGCACCACGCTGCTGCGCCTTGCCCCCAG 


939 


Db 


812 


871 


Qy 


940 


AAGGTGTTTGATGCGGTGGT-GGAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTC 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 

AAGGTGTTTGATGCGGTGGTGGGAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTC 


998 


Db 


872 


931 


Qy 


999 


TGATGGTTTCTGGACTGGGTCCCAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTC 

1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TGATGGTTTCTGGACTGGGTCCCAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTC 


1058 


Db 


932 


991 


Qy 


1059 


T T ACT T C C C T AAAAT C T CC AT CT AC CT GAGAGAT GAGAACT C CAGC AGGT C A T T C r GT AT 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 

T T ACT T C C CT AAAAT C T CC AT C T ACCT GAG AGAC GAGAACT C CAGCAG GT CAT T C C GT AT 


1 1 1 Q 
IliO 


Db 


992 


1051 


Qy 


1119 


CAC AAT C C T G C C T CAGCTT T ACAT T C AGC C CAT GAT GGGGGCCGGCCT GAAT TAT GAAT G 

1 1 M 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I | I | I I | | | | | | | | | || | | | | | | | | | | | | | | | | | | || | | | 

CACAATCCTGCCTCAGCTTTACATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATG 


1178 


Db 


1052 


llll 



Qy 



117 9 TTACCGATTCGGCATTTCCCCATCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGA 1238 



Db 


1112 


1 M 1 1 1 M 1 ! f 1 1 1 1 1 1 ! 1 I 1 1 f If I I 1 1 1 1 1 1 I 1 1 1 I ! 1 1 1 I 1 1 I f M i 1 1 f I I 1 I I I I 
T T AC C GAT T C GGC AT T T C C C CAT C CACAAAT GC GCT GGT GAT C G GT GC CAC GGT GAT G GA 


1171 


Qy 


1239 


GGGCTTCTACGTCATCTTCGACAGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTG 
1 1 1 M II 1 I i 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | | | M f | | | | | | | | j | | | | | | | | | 

GGGCTTCTACGTCATCTTCGACAGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTG 


1298 


Db 


1172 


1231 


Qy 


1299 


TGCAGAAATTGCA-GGTGCTGC-AGTGTCTGAAATTTCCGGGCCTTTCTC-AACAGAGGA 
M 1 M 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I | | | | | | | M 1 1 1 1 1 1 1 1 1 1 
TGCAGAAATTGCACGGTGCTGCAAGTGTCTGAAATTTCCGGGCCTTTCTCAAACAGAGGA 


1355 


Db 


1232 


1291 


Qy 


1356 


TGTAGCCAG-CAACTGTGTCCCCGCTC-AGTCTTTGAGCGA-GCCCATTTTGTGGATTGT 
1 1 M 1 1 M 1 1 1 1 1 I 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II I | || | M 1 1 1 II 1 
TGTAGCCAGCCAACTGTGTCCCCGCTCAAGTCTTTGAGCGACGCCCATTTTGTGGATTGT 


1412 


Db 


1292 


1351 


Qy 


1413 


GTCC TATGCGCTCATGAGCGTCTGTGG-AGCCATCCTCCTTGTC-TTAATCGTCCT 

Nil M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 I I I | | | | | | | | | | | | 
GTCCCTATGCCGCTCAATGAAGCGTCTGTGGAAGCCATCCTCCTTGTCGTTAATTCAGTC 


1466 


Db 


1352 


1411 


Qy 


1467 


GCTGCTGC TGCCGTTCCGGTGTCAGCGTCGCCCCCGTGACCCTGAGGTCGTCAATG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | 
GCTGCTGCTGCTTGCCGTTCCGGTGTCAGCGTCGCCCCCGTGACCCTGAGGTCGTCAATG 


1522 


Db 


1412 


1471 


Qy 


1523 


AT GAGT CCTCTCT— G GT C AGAC AT C GCT GGAAAT GAATAGC CAGGC C T GAC C T CAAGCAA 

1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AT GAGT CCTCTCT GG GT C AGAC AT C GCT GGAAAT GAATAGC CAGGC C T GAC C T CAAGCAA 


1581 


Db 


1472 


1531 


Qy 


1582 


C CAT GAACT C AGC TAT TAAGAAAAT CACATT T C C AG GGC AGCAGC C GGGAT C GAT G GT GG 

1 1 1 1 1 M 1 1 1 1 1 1 1 I II 1 1 1 I 1 1 1 1 1 1 II 1 1 1 1 1 IE 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I M 

CCATGAACTCAGCTATTAAGAAT^TCACATTTCCAGGGCAGCAGCCGGGATCGATGGTGG 


1641 


Db 


1532 


1591 


Qy 


1642 


CGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCTGCTCCCAGATGCCTTCTAGAT 
1 M 1 1 1 1 1 1 II 1 I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

CGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCTGCTCCCAGATGCCTTCTAGAT 


1701 


Db 


1592 


1651 


Ov 


17 0? 


i L/i^iijiti Hi iLii (jAI ill LAA.bC- 111 CAAATCCTCCCTACTTCCAAGAAAAA 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | | | | | | | | | I | | | | | | | | | | | | | | | 

TCACTGTCTTTTGATTCTTGATTTTCAAGCTTTT CAAATCCTCCCTACTTCCAAGAAAAA 


1760 


Db 


1652 


1711 


Qy 


1761 


T AAT T AAAAAAAAAAC T T C ATT C T AAAC C AAAAAAAA 17 97 

1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 

T AAT T AAAAAAAAAAC T T CAT T C T AAAC C AAAAC AGA 174 8 




Db 


1712 





RESULT 11 
US-09-280-116-85 

; Sequence 85, Application US/09280116A 
; Patent No. 6331427 
; GENERAL INFORMATION: 

APPLICANT: Robison, Keith E. 
; TITLE OF INVENTION: Nucleic Acid Molecules Encoding Human Protease Homologs 

FILE REFERENCE: 5800-24, 0358 0 0/17 6965 

CURRENT APPLICATION NUMBER: US/ 0 9/ 2 8 0 , 1 1 6A 

CURRENT FILING DATE: 1999-03-26 

NUMBER OF SEQ ID NOS : 2 68 

SOFTWARE: Patentln Ver . 2.0 
; SEQ ID NO 85 



LENGTH: 1021 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

OTHER INFORMATION: aspartyl proteases 
FEATURE : 

NAME/ KEY : mis c_f eature 
LOCATION: (1) . . (1021) 

OTHER INFORMATION: n - a, t, c or g 
US-09-280-116-85 

Query Match 52.9%; Score 953.6; DB 4; Length 1021; 

Best Local Similarity 98.8%; Pred. No. 3.7e-217; 

Matches 971; Conservative 0; Mismatches 10; Indels 2; Gaps 1; 

Qy 9 8 TCCGGGTGGCCGCGGCCACGAACCGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTG 157 

I I I I I I I M I i I I I I I I I I I i I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I 
Db 38 TCCGGGTGGCCGCGGCCACGAACCGCGTAGTTGCGCCCACCCCGGGACCCGGGAGCCCTG 97 

Qy 158 CCGAGCGCCACGCCGACGGCTTGGCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGG 217 

I I M I I I I I M I I I I I I I I I I I I I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I I | | I I I 
Db 98 CCGAGCGCCACGCCGACGGCTNGGCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGG 157 

Qy 218 GCGCCGCCAACTTCTTGGCCATGGTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACT 277 

I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I i I I I I i I I I I M I I I I I I I I I I 
Db 158 GCGCCGCCAACTTCTTGGCCATGGTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACT 217 

Qy 278 AC CT GGAGAT GCT GAT C GGGAC C C C C C C GC AGAAGCT AC AGAT T C T C GT T GACACT GGAA 337 

I I I I I I I I I I I I I I I I I I I i I I I I I I It I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 218 AC CT G GAGAT GCT GAT C G GGAC C C C C C C GCAGAAGC T AC AGAT T CT C GT T GACACT GGAA 277 

Qy 338 GC AGTAACT T T GC C GT G G CAG GAAC C C C G C ACT C C T AC AT AGAC AC GT AC T T T GACAC AG 397 

I I I I I I I i I I I I I I I I I I I I i I II I I I I I M I I I I I I I I I I I I I I I I I i I ! I I I I I I I I I 
Db 278 G C AGT AAC T T T GC C GT GGC AGGAAC C C C GC AC T C CT AC AT AGAC AC GT ACT T T GACAC AG 337 

Qy 398 AGAGGT CTAGCACAT AC CGCT C CAAGGGCTTT GACGT C AC AGT GAAGTACACACAAGGAA 4 57 

! I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 338 AGAG GT C T AGC AC AT AC C GC T C CAAGG GC T T T GAC GT CAC AGT GAAGTACACACAAGGAA 3 97 

Qy 458 GCT GGAC GGGCTTCGTTGGGGAAGACCTCGT CAC CAT CCCCAAAGGCTTCAATACTTCTT 517 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I II I I I I I I I i I I I I I M M I I I I I I I 
Db 398 GCT GGACGGGCTTCGTTGGGGAAGACCTCGT CAC CAT CCCCAAAGGCTTCAATACTTCTT 457 

Qy 518 TTCTTGTCAACATTGCCACTATTTTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAAT 577 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I ! I I I I I I I I I I I M I I I I I II I I I 
Db 4 58 TTCTTGTCAACATTGCCACTATTTTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAAT 517 

Qy 57 8 GGAATGGAATACTTGGCCTAGCTTATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGA 637 

I M I I I I I I I II I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 518 GGAATGGAATACTTGGCCTAGCTTATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGA 577 

Qy 638 CCTTCTTCGACTCCCTGGT GACAC AAGCAAACATCCCCAACGTTTTCTCCATGCAGATGT 697 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I M I I I I I I M I I I I I I I I I I 
Db 578 CCTTCTTCGACTCCCTGGTGACACAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGT 637 



698 GTGGAGCCGGCTTGCCCGTT GCT GGATCT GGGAC CAACGGAGGTAGTCTTGTCTTGGGTG 7 57 
I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 



Db 638 GTGGAGCCGGCTTGCCCGTTGCTGGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTG 697 

Qy 75 8 GAAT T GAAC CAAGT T T GT ATAAAGGAGAC AT C T GGT AT AC C C CT AT TAAG GAAGAGT GGT 817 

I I I M I I I I f I I I I I I I I I I I I I I I i I I I I I f I I I I I I I I I I I I I I I I I I I I I I I I | I I I 
Db 698 GAAT T GAAC CAAGTT T GT AT AAAG GAGAC AT C T GGT AT AC C C C T ATT AAG GAAGAGT GGT 757 

Qy 818 AC T AC C AGAT AGAAAT T C T GAAAT T GGAAAT T GGAGG C CAAAGC C TTAAT C T GGACT G C A 877 

I I I I I I I I I I I I I i I I I I I I I I t I I I I I I I I I I II I II I I I I I I I t I I I I I I I I I I | I I I 
Db 758 ACT AC CAGAT AGAAATT CT GAAAT T G GAAAT T GGAGGC CAAAGC C TTAAT C T GGAC T GC A 817 

Qy 87 8 GAGAGTATAACGCAGACAAGGCCATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCC 937 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | M 
Db 818 GAGAGTATAACGCAGACAAGGCCATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCC 877 

Qy 938 AGAAGGTGTTTGATGC — GGTGGTGGAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATT 995 

I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 87 8 AGAAGGTGTTTGATGCCGGTGGTGGAAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATT 937 

Qy 996 CTCTGATGGTTTCTGGACTGGGTCCCAGCTGGCGTGCTGGACGAATTCGGAAACACCTTG 1055 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I M I I I I I I I I I | I | I 
Db 93 8 CTCTGATGGTTTCTGGACTGGGTCCCACTTGGCGTGCTGGACGAATTCGGAAACACCTTG 9 97 

Qy 105 6 GTCTTACTTCCCTAAAATCTCCA 107 8 

I II I I I I I I I I I I I I I I I I I II 
Db 99 8 GTCTTACTTCCCTAAAATCTTCA 102 0 

RESULT 12 

US- 0 9-72 4-5 6 6A- 4 2 

; Sequence 42, Application US/09724566A 
; Patent No. 6627739 
; GENERAL INFORMATION: 



APPLICANT 


: Anderson, John P . 


APPLICANT 


: Basi, Guriqbal 


APPLICANT 


: Doane, Minh Tam 


APPLICANT 


: Frigon, No. 6627739mand 


APPLICANT: 


: John, Varghese 


APPLICANT: 


: Power, Michael 


APPLICANT : 


: Sinha, Sukanto 


APPLICANT: 


: Tatsuno, Gwen 


APPLICANT : 


: Tung, Jay 


APPLICANT: 


Wang, Shuwen 


APPLICANT: 


McConlogue, Lisa 



; TITLE OF INVENTION: Beta-Secretase Enzyme Compositions and 
TITLE OF INVENTION: Methods 
FILE REFERENCE: 228-US-NEWC2 

CURRENT APPLICATION NUMBER: US/ 09/724 , 5 66A 

CURRENT FILING DATE: 2000-11-28 

PRIOR APPLICATION NUMBER: US 09/501,708 
; PRIOR FILING DATE: 2000-02-10 

PRIOR APPLICATION NUMBER: 60/119,571 

PRIOR FILING DATE: 1999-02-10 

PRIOR APPLICATION NUMBER: 60/139,172 

PRIOR FILING DATE: 1999-06-15 

NUMBER OF SEQ ID NOS : 104 
; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 42 



; LENGTH: 2 34 8 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-09-724-566A-42 

Query Match 22.0%; Score 397.4; DB 4; Length 2348; 

Best Local Similarity 55.0%; Pred. No. 7.1e-85; 

Matches 863; Conservative 0; Mismatches 676; Indels 30; Gaps 3; 
Qy 2 TGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCCG 61 

MM Ml MINI I Ml II I II II II M II I II 

Db 2 38 TGGGGGCAGGCGCCAGGGACGGACGTGGGCCAGTGCGAGCCCAGAGGGCCCGAAGGCCGG 2 97 

Qy 62 CCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAACC 121 

M M I I II III I I II I II I I M I I I I I I | | 

Db 2 98 GGCCCACCATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGC 357 

Qy 122 GCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTGG 181 

I I M III III II III I I II 

Db 358 CTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCC 417 

Qy 182 CGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGC 22 4 

III! Ml II II I I I I I I II I Ml 
Db 418 CCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGG 477 

Qy 225 -CAACTTCTTGGCCATGGTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGG 283 

Mill M I I I I I I II I I I I II I Ml I I I I I I I I I I I I I I I I || 
Db 478 GCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGG 537 

Qy 2 84 AGAT G C T GAT C G G GAC C C C C C C G C AGAAGCT ACAGAT T CT C GT T GAC AC T GGAAG C AGT A 34 3 

M I I I I II i I II I II M I I I II I I II I I I I I I II II I I I I I I I 

Db 538 AGAT GAC C GT G GGCAG C C C C C C GC AGAC GC T CAAC AT C CT G GT G GAT AC AG G CAGCAGT A 597 

Qy 344 ACT T T G C C GT GG C AGGAAC C C C G C AC T C C T AC ATAGACAC GT ACT T T GACAC AGAGAGGT 4 03 

M M I M II I I I II II I II I I I I I I || I I || || || 

Db 598 ACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGT 657 

Qy 4 04 CT AGCACATACC GCT C CAAGGGCTTT GACGT CACAGT GAAGT AC AC ACAAGGAAGCT GGA 4 63 

I M I II II I I II I I I || Ml | | M I I I M I III 

Db 658 CCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGG 717 

Qy 4 64 CGGGCTTCGTTGGGGAAGACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTG 523 

II EM I I I II I I I I I I I I II I i I I I I I I I I I I I 

Db 718 AAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTG 777 

Qy 52 4 TCAACATTGCCACTATTTTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATG 583 

M II II II I III || || | | | || | | | | || | | M II I I I I I 

Db 778 C CAAC AT T GCT G C CAT C ACT GAAT C AGACAAGT T CT T CAT CAAC GGCT C CAAC T GGGAAG 837 

Qy 58 4 GAAT AC T T GG C C T AGCTTAT GC CAC AC T T GC CAAG C CAT C AAGTT CT C T GGAGAC CT T CT 643 

I M I I II II II II I M I I I I II I II I I II I II I I I I I I 

Db 838 GCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCT 8 97 

Qy 644 T C GAC TCCCTGGT GAC ACAAGCAAAC AT C C C CAAC GTTTTCTC CAT G C AGAT GT GT GGAG 703 

I I M I I I I I II I II I I I I I I I | M | | || | || || | | | | || | | | | 
Db 898 TTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTG 957 



Qy 7 04 CCGGCTTGCCCGT TGCTGGATCTGGGACCAACGGAGGTAGTCTTGTCTTGG 754 

I Mill Ml I II I I | I II I I I II I I | | I 

Db 958 CTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTG 1017 

QY 7 55 GT GGAAT T GAAC CAAGT T T GT AT AAAG G AGAC AT CT G GT AT AC C C CT AT T AAGGAAGAGT 814 

I M li II I I I I I I I II II I I I I I I I I I I I | | | | | | 

Db 1018 GAGGTATCGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGT 107 7 

Qy 815 G GT ACT AC C AGATAGAAAT T C T GAAAT T G GAAAT T G GAG GC CAAAGCC T T AAT C T GGACT 8 74 

Mill! I I I I I I II I II I I I I I I | | | | | | | | | | | 

Db 107 8 G GT AT TAT GAGGT GAT CAT T GT GC GG GT G GAGAT CAATG GAC AGGAT C T GAAAAT GGACT 1137 

Qy 8 75 G CAGAGAGT AT AAC GCAGACAAGGC CAT C GT GGAC AGTG GC AC C AC GC TGCTGCGCCTGC 934 

III M II I I I I I I II II I I I I I I I I I I I I I I I I I I I I | | | | If | 

Db 1138 G CAAG GAGT ACAACT AT GACAAGAGCAT T GT GGACAGT GGC AC C AC CAAC C T T C GT T T GC 1197 

Qy 935 CCCAGAAGGTGTTTGATGCGGTGGTGGAAGCTGTGGCCCGCGCATCTCTGATTCCAGAAT 994 

I I M I I I I I I I f II II I II III I | | | | | || 

Db 119 8 CCAAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGT 1257 

Qy 995 TCTCT GAT GGTTTCT GGACT GGGTCCCAGCTGGCGTGCTGGACGAATTCGGAAACACCTT 1054 

M II M I I I I I I I I I I || || | M I I II I II I I I I I I II 

Db 1258 TCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTT 1317 

Qy 1055 G GT CT T AC T T C C C T AAAAT CT C CAT CT AC CT GAGAGAT GAGAACT C C AGC AGGT CAT T C C 1114 

M Mill M I M I I I I I I I I I I I I I | | | | I | | | | | I 

Db 1318 GGAACAT T T T C C CAGT CAT CT CACT CT AC CT AAT GGGT GAGGT T AC CAAC CAGT C CT T C C 1377 

Qy 1115 GTAT C ACAAT CCT GC C T CAGC T T T ACAT T CAG C C CAT GAT GGGGGCCGGCCTG AATT 1171 

I M I I I I I II I I I I I II III I I III || | | || | | 
Db 137 8 G CAT C AC CAT C CT T C C GC AGC AAT AC CT G C GG C CAGT GGAAGAT GT G G C C ACGT C C CAAG 14 37 

Qy 1172 AT GAAT GTTAC C GAT T C GGC AT T T C C CC AT C C ACAAAT GC G C T GGT GAT C GGT G C CAC GG 1231 

I I I M I I I I I I I I I I I I I || || | || || | | || 

Db 14 38 AC GACT GT T AC AAGT T T G C CAT CT C ACAGT CAT C CAC G G G CACT GT TAT G G GAGCT GT T A 14 97 

Qy 1232 T GAT GGAGGGCT T CT ACGT CAT CTTCGACAGAGCCCAGAAGAGGGTGGGCTTCGCAGCGA 1291 

I I M II I II I I I M II I I I I II I I I I I I I || | | | M I I I I I I 
Db 14 98 TCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCA 1557 

Qy 1292 GCCCCTGTGCAGAAATTGCAGGTGCTGCAGTGTCTGAAATTTCCGGGCCTTTCTCAACAG 1351 

I I I M I II III I || | | | | | | | 

Db 1558 GCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCT 1617 

Qy 1352 AGGATGTAGCCAGCAACTGTGTCCCCGCTCAGTCTTTGAGCGAGCCCATTTTGTGGATTG 1411 

I I I I I I I I I I I I II 

Db 1618 T GGACAT GGAAGACT GT GGCTACAACATT CCACAGACAGAT GAGT CAACCCT CAT GAC CA 1677 

Qy 1412 TGTCCTATGCGCTCATGAGCGTCTGTGGAGCCATCCTCCTTGTCTTAATCGTCCTGCTGC 1471 

i M I M I I I I I I I I I I I I M I I I I I I I I 

Db 1678 TAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGT 1737 

Qy 14 72 TGCTGCCGTTCCGGTGTCAGCGTCGCCCCCGTGACCCTGAGGTCGTCAATGATGAGTCCT 1531 

II I II I II III I I I II I I I I I II I I II 

Db 1738 GTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCT 1797 

Qy 1532 CTCTGGTCA 154 0 



Db 17 98 CCCTGCTGA 18 06 



RESULT 13 
US-09-72 4-566A-44 

Sequence 44, Application US/09724566A 
Patent No. 6627739 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Anderson, John P. 
Basi, Guriqbal 
Doane, Minh Tarn 
Frigon, No. 6627739mand 
John, Varghese 
Power, Michael 
Sinha, Sukanto 
Tatsuno, Gwen 
Tung, Jay 
Wang, Shuwen 
McConlogue, Lisa 
TITLE OF INVENTION: Beta-Secretase Enzyme Compositions and 
TITLE OF INVENTION: Methods 
FILE REFERENCE: 22 8-US-NEWC2 

CURRENT APPLICATION NUMBER: US/09/724, 56 6 A 
CURRENT FILING DATE: 2000-11-28 
PRIOR APPLICATION NUMBER: US 09/501,708 
PRIOR FILING DATE: 2000-02-10 
PRIOR APPLICATION NUMBER: 60/119,571 
PRIOR FILING DATE: 1999-02-10 
PRIOR APPLICATION NUMBER: 60/139,172 
PRIOR FILING DATE: 1999-06-15 
NUMBER OF SEQ ID NOS : 104 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 44 
LENGTH: 2348 
TYPE: DNA 

ORGANISM: Homo sapiens 
US- 09-72 4-566A-44 

Query Match 22.0%; Score 397.4; DB 4; Length 2348; 

Best Local Similarity 55.0%; Pred. No. 7.1e-85; 

Matches 863; Conservative 0; Mismatches 676; Indels 30; Gaps 3; 

QY 2 TGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCCG 61 

I I I I Ml I I t I I I I Ml Ml I M I I I I I II I II 

Db 2 38 TGGGGGCAGGCGCCAGGGACGGACGTGGGCCAGTGCGAGCCCAGAGGGCCCGAAGGCCGG 2 97 

QY 62 CCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAACC 121 

II I I I I I I III I I I I I || I I M I I | | | | | | 

Db 298 GGCCCACCATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGC 357 

Qy 122 GCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTGG 181 

I I II IN IN II III I I I I 

Db 358 CTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCC 417 

Qy 182 CGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGC 224 

I I I I M I II I I I I I I I I || | Ml 



418 CCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGG 477 



225 -CAACTTCTTGGCCATGGTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGG 283 

| | | | | | | | I I I 1 I I I I I I I I I I Ml I M I 1 t I I I I 11 I It I II 
478 GCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGG 537 

2 84 AG AT GC T GAT C GG GACC C C C C C GC AGAAG CT AC AGAT T CT C GTT GACACT GGAAGCAGT A 34 3 

Mill | || | || I I I I I 1 IN N I I It 

538 AGAT GAC C GT GGG CAGC C C C C C G CAGAC G CT CAAC AT C CT G GT G GAT AC AG GC AG C AG T A 597 

344 ACT T T G C C GT G GCAGGAACC C C GC ACT C CT AC AT AG ACAC GTACTT T GAC ACAGAGAG GT 4 03 

| | | 1 | | | | | | I | I I I I I I I I I I I I I N I I II I I II 

ACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGT 



598 
404 
658 
464 
718 
524 



CT AGCACATACCGCT CCAAGGGCTTT GACGT CACAGT GAAGTACACACAAGGAAGCT GGA 

II I I I I I I I I I I I I I I I I I I I I I I Ml 

CCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGG 

CGGGCTTCGTTGGGGAAGACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTG 

II IN I I I It M I I M I I M I I I I Ml t I I I II 

AAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTG 



657 
463 
717 
523 
777 



TCAACATTGCCACTATTTTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATG 583 

| | | M II II III I I M I II I I I I M M I I M I I I M I I 

778 C C AACAT T GCT GC C AT C ACT GAAT C AGACAAGT T CT T CAT CAAC GG CT C CAACT GGGAAG 837 



584 
838 
644 



GAATACTTGGCCTAGCTTATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCT 

I || | M MUM III M I I I I I I I MM 

GCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCT 



TC GACT CCCTGGT GACACAAGCAAACAT C C CCAACGTTTT CT C CAT GCAGAT GT GT GGAG 

I I M Ml I I I M I I I M M I I I II I 1 Mil 

8 98 TTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTG 



643 
897 
703 
957 



7 04 CCGGCTTGCCCGT TGCTGGATCTGGGACCAACGGAGGTAGTCTTGTCTTGG 754 

| M I 1 M I I t I I I II I M I Mil 

958 CTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTG 1017 

755 GT GGAATTGAACCAAGTTT GT ATAAAGGAGACAT CT GGT ATACCCCT ATTAAGGAAGAGT 814 

I || || || I I I M I I I 1 M I I M II II II II I MM 

1018 GAG GT AT CGAC C ACT C GCT GT AC ACAGGC AGT CT CT G GT AT AC AC C CAT C C G GC G G GAGT 
815 G GT ACT AC CAGAT AGAAAT T CT GAAAT T GGAAAT T GGAG G C CAAAGC CT T AAT CT G GACT 

| 1 I I Mill I I I I II I I M MM MUM 

1078 G GT AT TAT GAG GT GAT CAT T GT GC G GGT G G AGAT C AAT GGACAG GAT CT GAAAAT GG ACT 
87 5 GCAGAGAGTATAACGCAGACAAGGCCATCGTGGACAGTGGCACCACGCTGCTGCGCCTGC 

Ml | | | I I II I II II II I II II It It II M II M II I MM Mt 

1138 G CAAG G AGT ACAACT AT GAC AAGAGC AT T GT G GAC AGT G GCAC CAC CAAC CT TCGTTTGC 
935 CCCAGAAGGTGTTTGATGCGGTGGTGGAAGCTGTGGCCCGCGCATCTCTGATTCCAGAAT 

|| I I M M II 1 1 II M I I I Ml I MM I II 

1198 CCAAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGT 



995 TCTCTGATGGTTTCTGGACTGGGTCCCAGCTGGCGTGCTGGACGAATTCGGAAACACCTT 

II || I I M M II II M M M I I II I I M II II MMM 

12 58 TCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTT 



1077 
874 
1137 
934 
1197 
994 
1257 
1054 
1317 



QY 1055 GGTCTTACTTCCCTAAAATCTCCATCTACCTGAGAGATGAGAACTCCAGCAGGTCATTCC 1114 

M I | | | I I I I M MINI! I I I M I Mil I I I I I I I 

Db 1318 GGAACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCC 1377 

Qy 1115 GTATCACAATCCTGCCTCAGCTTTACATTCAGCCCATGATGGGGGCCGGCCTG AATT 1171 

I I I I I I I 1 I I I I 1 I I M M I M I 

Db 1378 GCATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAG 1437 

Qy 1172 ATGAATGTTACCGATTCGGCATTTCCCCATCCACAAATGCGCTGGTGATCGGTGCCACGG 1231 

| | | | | | | | I II I M It I I "I " I" 

Db 1438 AC GAC T GT T ACAAGT T T G C CAT CT C AC AGT CAT C CAC G GG C ACT GTT AT G G GAGCT GT T A 14 97 

Qy 12 32 TGATGGAGGGCTTCTACGTCATCTTCGACAGAGCCCAGAAGAGGGTGGGCTTCGCAGCGA 12 91 

| | I I I I ! I I I I I I I I I M I I 1 I I I I ! I I I ! I I 

Db 1498 TCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCA 1557 

Ov 1292 GCCCCTGTGCAGAAATTGCAGGTGCTGCAGTGTCTGAAATTTCCGGGCCTTTCTCAACAG 1351 

| | | M I It I I I I It Mill M 

Db 155 8 GCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCT 1617 

Ov 1352 AG GAT GT AGC CAGCAACT GT GTCCCCGCT C AGT CT T T GAGC GAGC C CAT T T T GT GGAT T G 1411 

I I I I II 

Db 1618 T GGAC AT GGAAGACT GT GG C T ACAAC AT T C C AC AGACAGAT GAGT CAAC C CT CAT GAC C A 1677 

Qy 1412 TGTCCTATGCGCTCATGAGCGTCTGTGGAGCCATCCTCCTTGTCTTAATCGTCCTGCTGC 1471 

| | | | | M | I II II I I I I I I I 

Db 167 8 TAGCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGT 17 37 

Ov 1472 TGCTGCCGTTCCGGTGTCAGCGTCGCCCCCGTGACCCTGAGGTCGTCAATGATGAGTCCT 1531 

| | | | | | MM II 

Db 1738 GTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCT 1797 

Qy 1532 CTCTGGTCA 1540 

I III I I 
Db 17 98 CCCTGCTGA 18 06 



RESULT 14 
US-09-724-566A-4 8 

Sequence 48, Application US/09724566A 
Patent No. 6627739 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Anderson, John P. 
Basi , Guriqbal 
Doane, Minh Tarn 
Frigon, No. 66277 39mand 
John, Varghese 
Power, Michael 
Sinha, Sukanto 
Tatsuno, Gwen 
Tung, Jay 
Wang, Shuwen 



McConlogue, Lisa 
TITLE OF INVENTION: Beta- Secretase Enzyme Compositions and 
TITLE OF INVENTION: Methods 
FILE REFERENCE: 22 8-US-NEWC2 



CURRENT APPLICATION NUMBER: US/09/724 , 566A 
CURRENT FILING DATE: 2000-11-28 
PRIOR APPLICATION NUMBER: US 09/501,708 
PRIOR FILING DATE: 2000-02-10 
PRIOR APPLICATION NUMBER: 60/119,571 
PRIOR FILING DATE: 1999-02-10 
PRIOR APPLICATION NUMBER: 60/139,172 
PRIOR FILING DATE: 1999-06-15 
NUMBER OF SEQ ID NOS : 104 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 48 
LENGTH: 16080 
TYPE : DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Expression Vector pCEK 
FEATURE : 

NAME/KEY: misc_feature 
LOCATION: (1) . . . (16080) 
OTHER INFORMATION: n = A,T,C or G 
US-09-72 4-566A-4 8 

Query Match 22.0%; Score 397.4; DB 4; Length 16080; 

Best Local Similarity 55.0%; Pred. No. 1.5e-84; 

Matches 863; Conservative 0; Mismatches 676; Indels 30; Gaps 3; 

Qy 2 TGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCCG 61 

I | M I I I II I I I 1 I I I I II I II 

Db 1607 TGGGGGCAGGCGCCAGGGACGGACGTGGGCCAGTGCGAGCCCAGAGGGCCCGAAGGCCGG 1666 

Qy 62 CCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAACC 121 

|| I Ill I I II I II I I I M t I I I I I I 

Db 1667 GGCCCACCATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGC 1726 

Qv 122 GCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTGG 181 

| II IN I I 

Db 1727 CTGCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCC 17 8 6 

Qy 182 CGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGC 224 

till III II III II I M M I III 

Db 17 87 CCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGG 184 6 

Q y 225 -CAACTTCTTGGCCATGGTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGG 283 

Ml I I I 1 I I Ml I I I I I I I I I I I I I I I M I 

Db 184 7 GCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGG 1906 

Qy 28 4 AGAT GCT GAT CGGGACCC C CC CGCAGAAGCT ACAGATT CT C GTT GACACT GGAAGCAGT A 34 3 

| | 1 | | | | I I I II I I I I I I I I I I I I I I 1 I I I I I I I I I 

Db 1907 AGAT GAC C GT G G GCAGC C C C C CG CAGAC GCT CAACAT C CT G GT GGAT AC AG GC AG C AGT A 1966 

Qy 344 ACT T T G C C GT G G CAG GAAC C C CG CACT C CT AC AT AGAC AC GT ACTT T GAC AC AGAGAG GT 4 03 

I M I I I I I I I I I MM IN IN I I I I M I II I I 1 1 

Db 1967 ACTTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGT 2026 

Qy 4 04 C T AGCACAT AC C GCT C CAAGG GC TT T GAC GT C AC AGT GAAGT ACACACAAG GAAG CT G GA 4 63 

Mill I I I M M I I' 'I "I 

Db 2 027 CCAGCACATACCGGGACCTCCGGAAGGGTGT GTATGTGCCCTACACCCAGGGCAAGTGGG 2 08 6 



4 64 CGGGCTTCGTTGGGGAAGACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTG 52 3 

|| IN I I I I I I I I I I I I I I t I i I I I I I I I I I I I 

2 087 AAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTG 214 6 

52 4 T CAACAT T GC CACT AT TT TT GAAT C AGAGAAT TTCTTTTTGCCT GGGAT T AAAT G GAAT G 583 

I I I I 1 I I I I III I M I I I I M I I I I I I I I 1 I I I I I I I I 

2147 C CAACAT T GC T GC CAT CACT GAAT C AGAC AAGT T CT T CAT C AAC GGCT C C AACT GG GAAG 2206 

584 GAATACTTGGCCTAGCTTATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCT 64 3 

I II II II II II I I I II t I I I I I II! I I I I I I t I MM 

22 07 GCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCT 22 66 

64 4 T C GACT C C CT GGT GACAC AAGCAAAC AT C C C CAAC GT T TT CT C C AT GC AGAT GT GT G GAG 7 03 

I I I I II I II II I II I I I I M M M t M II M I II M I M M I I 

22 67 TTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTG 232 6 

7 04 CCGGCTTGCCCGT TGCTGGATCTGGGACCAACGGAGGTAGTCTTGTCTTGG 754 

| I I I II I I I I I I I I I I I I Mil 

2327 CTGGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTG 2386 

755 GT GGAATT GAAC CAAGTTT GT ATAAAGGAGACAT CT GGTATAC CCCT AT TAAGGAAGAGT 814 
| | | || I I I I I M I I I I 1 II I I I I I I I I I I I I MM 

238 7 GAG GT AT C GACCAC T C G CT GT AC AC AG G C AGT CT C T G GTAT ACAC C CAT C C GGC GGGAGT 2 4 4 6 

815 GGT ACT AC C AGAT AGAAAT T CT GAAAT T GGAAAT T GGAGGC CAAAG C CT T AAT CT G GACT 874 
|| M M It I I I I II I I M I 1 I I I I I I M M I I M 

24 47 GGT AT TAT GAGGT CAT CAT T GT GC GGGT G GAG AT CAAT GGACAGGAT CT GAAAAT GGACT 25 06 

875 GC AG AGAGT AT AAC G CAGACAAGG C CAT C GT G GAC AGT GGC AC C AC GCT GCTGCGCCTGC 934 

Ml II M I II I II I I II I I I M I II II I I I I II M M MM III 

2507 GCAAGGAGT ACAACT AT GACAAGAGC AT T GT GGAC AGT G GC AC C AC CAAC CTTCGTTTGC 2566 

935 CCCAGAAGGTGTTTGATGCGGTGGTGGAAGCTGTGGCCCGCGCATCTCTGATTCCAGAAT 9 94 

M I I I I i I I II II I I I I M I I I I II II I II 

2567 C CAAGAAAGT GT TT GAAG CT GCAGT CAAAT C CAT CAAGG C AG C CT C C T C CAC GGAGAAGT 262 6 

995 TCTCTGATGGTTTCTGGACTGGGTCCCAGCTGGCGTGCTGGACGAATTCGGAAACACCTT 1054 

II M II II II I M I i I II M I I I M I I I M I I M I II I 

2 62 7 T CCCT GAT GGTTTCTGGCTAGGAGAGCAGCT GGT GT GCT GGCAAGCAGGCACCACCCCTT 268 6 

1055 GGTCTTACTTCCCTAAAATCTCCATCTACCTGAGAGATGAGAACTCCAGCAGGTCATTCC 1114 

|| M I II I M I I II II I I I I I I I II IMI II I I II I 

2 687 GGAACATTTTCCCAGT CAT CT CACT CTACCT7VAT GGGT GAGGTTACCAACCAGTCCTTCC 27 4 6 

1115 GTAT CACAATCCTGCCTCAGCTTTACATTCAGCCCAT GAT GGGGGCCGGCCTG AATT 1171 

I I M II I II II M II I I I II I 1 I I I II I I II I I 

27 4 7 GC AT CAC CAT C CT T C C GC AGCAAT AC CT G CG GC C AGT GGAAGAT GT G GC CAC GT C C CAAG 2 8 06 

1172 ATGAATGTTACCGATTCGGCATTTCCCCATCCACAAATGCGCTGGTGATCGGTGCCACGG 1231 

I II II II II II I II I M I M II I I I M I I I I 

2 8 07 ACGACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTA 28 66 

12 32 T GAT G GAGGG CT T C T AC GT CAT C T T C GAC AGAGC C C AGAAGAG G GT GGGCTTCG C AGC GA 12 91 

III 1 I I II I I I I II I I I I M II II I I II II I II I I 

2867 TCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCA 2926 



Qy 1292 GCCCCTGTGCAGAAATTGCAGGTGCTGCAGTGTCTGAAATTTCCGGGCCTTTCTCAACAG 1351 

Mil! I II III I I t I I I I I II 

Db 2 927 GCGCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCT 2 98 6 

Qy 1352 AGGATGTAGCCAGCAACTGTGTCCCCGCTCAGTCTTTGAGCGAGCCCATTTTGTGGATTG 1411 

Mill I I I I I I I I I I I I II 

Db 2987 T G GACAT GGAAGACT GT GG CT ACAAC AT T CC AC AGACAGAT GAGT C AAC C CT CAT GAC CA 3046 

Qy 1412 TGTCCTATGCGCTCATGAGCGTCTGTGGAGCCATCCTCCTTGTCTTAATCGTCCTGCTGC 14 71 

I I I I I I I I ! I I I I I I I I I It I I I I I I I I 

Db 3047 TAGCCTATGT CATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGT 3106 

Qy 1472 TGCTGCCGTTCCGGTGTCAGCGTCGCCCCCGTGACCCTGAGGTCGTCAATGATGAGTCCT 1531 

II I II I II IN I I I I I I I ! I I I I I I II 

Db 3107 GTCAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCT 3166 

Qy 1532 CTCTGGTCA 154 0 

I I II I I 
Db 3167 CCCTGCTGA 3175 



RESULT 15 
US-09-724-566A-1 

Sequence 1, Application US/09724566A 
Patent No. 6627739 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF 



Anderson, John P. 
Basi, Guriqbal 
Doane, Minh Tarn 
Frigon, No. 662 77 3 9mand 
John, Varghese 
Power, Michael 
Sinha, Sukanto 
Tatsuno, Gwen 
Tung, Jay 
Wang, Shuwen 
McConlogue, Lisa 
INVENTION: Beta-Secretase Enzyme Compositions and 



TITLE OF INVENTION: Methods 
FILE REFERENCE: 22 8-US-NEWC2 

CURRENT APPLICATION NUMBER: US/ 0 9/72 4 , 5 66A 
CURRENT FILING DATE: 2000-11-28 
PRIOR APPLICATION NUMBER: US 09/501,708 

2000-02-10 
NUMBER: 60/119, 571 

1999-02-10 
NUMBER: 60/139, 172 
1999-06-15 
NUMBER OF SEQ ID NOS : 104 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 1 
LENGTH: 1503 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-724-566A-1 



PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 



Query Match 



21.9%; Score 395; DB 4; Length 1503; 



Best Local Similarity 55.6%; Pred. No. 2.2e-84; 

Matches 805; Conservative 0; Mismatches 630; Indels 12; Gaps 2; 



Qy 106 GCCGCGGCCACGAACCGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGC 165 

III I I II I I I I I II III I I III I I I I I 

Db 55 GCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 114 

Qy 166 CACGCCGACGGCTTGGCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCC 225 

II I I t I I I II I I I I I I I I I II 

Db 115 CTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 17 4 

Qy 22 6 AACTTCTTGGCCATGGTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAG 285 

I Ml I I f I I I I I i I I I I I I I I III I II II I I I I I I I I I I I I I I I 

Db 175 AGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAG 234 

Qy 2 8 6 ATGCTGATCGGGACCCCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAAC 34 5 

III I M I I I I I I I II I I I I I I I I I I I I ! I I I I I I I I I I I I I I I 

Db 235 ATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAAC 2 94 

Qy 346 TTTGCCGTGG CAG GAAC C C C GC AC T C CT AC AT AGAC AC GT ACT T T GAC AC AGAGAGGT CT 405 

M II I 1 I 1 I I I I I I I II I I I I I t I I 1 I II II ME 

Db 2 95 TTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCC 354 

Qy 4 0 6 AGC ACAT AC C GC T C CAAG G GCT T T GAC GT C ACAGT GAAGT ACAC ACAAGGAAGCT G GAC G 4 65 

! II I I I I II I I I I I II III I I II I I I I i I III 

Db 355 AGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAA 414 

Qy 4 66 GGCTTCGTTGGGGAAGACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTC 52 5 

II III II I I I I I I I I I I M I I M I III I I I I M I 

Db 415 GGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCC 474 

Qy 52 6 AAC AT T GC CAC TAT T T T T GAAT CAGAGAATT T CTTTTTGCCT GGGAT T AAAT GGAAT GGA 58 5 

I I I I I I I I III I I I II I I I I I I I I I M I II I I I I I I I I 

Db 475 AAC AT T GC T GC CAT C ACT GAAT C AGACAAGT T CT T CAT C AAC GGC T C CAACT GGGAAGGC 534 

Qy 586 ATACTTGGCCTAGCTTATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTC 64 5 

II II II II II I I I I I I I I I M III II I II II I I Mill 

Db 535 ATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTT 594 

Qy 646 GAC T C C CT GGT GACACAAGC AAAC AT C C C C AAC GT T T T CT C CAT G CAGAT GT GT GGAGC C 705 

I I I II I I I I I I II I M I I I I II I I M I I I I I II I I I I I I I I I I 

Db 595 GACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCT 654 

Qy 7 06 GGCTTGCCCGT TGCTGGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGT 756 

I I I I I I I I I II I I I I I I I M I I I Mill 

Db 655 GGCTTCCCCCTCAACCAGTCT GAAGT GCT GGCCTCTGTCGGAGGGAGCAT GAT CATT GGA 714 

Qy 757 GGAATT GAACCAAGTTT GT ATAAAGGAGACAT CT GGT AT AC C CCTATTAAGGAAGAGT GG 816 

I I I I I I I I II I I I M II I M I I II I I I I I I II I I I I 

Db 715 GGTATCGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGG 77 4 

Qy 817 T ACTAC CAG AT AG AAAT T CT GAAAT T GGAAAT T G GAGG C C AAAG C CT T AAT CT GGACT GC 87 6 

I I I I III Mill II I I I I MM II II It I I I II I 

Db 77 5 TAT TAT GAG GT GAT CAT T GT GC G G GT G GAG AT CAAT G GAC AG GAT CT GAAAAT GGAC T GC 834 

Qy 87 7 AGAGAGTATAACGCAGACAAGGCCATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCC 936 

I Mill Eli i M M I I I I I I M I I I M II II I I M MM I II I I 



Db 



8 35 AAG GAGT ACAACT AT GACAAGAGCAT T GT GGAC AGT GGC AC C AC CAAC CTTCGTTTGCCC 8 94 



Qy 937 CAGAAGGTGTTTGATGCGGTGGTGGAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTC 996 

I I I I I I I I I I I I II I II III I I I I I I I I I I 

Db 8 95 AAGAAAGT GT T T GAAG CT G CAGT CAAAT C CAT CAAG G C AG C C T C CT C CAC GGAGAAGT T C 954 

Qy 9 97 TCT GAT GGTTTCTGGACTGGGTCCCAGCTGGCGTGCT GGAC GAATTCGGAAACACCTTGG 1056 

I I I I I I I I I I I I I I II I I I I if I I I I I I I I MINIM 

Db 955 CCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGG 1014 

Qy 1057 T CT T ACTT C C CTAAAAT C T C C AT CT AC CT GAGAGAT GAGAACT C CAGCAGGT CAT T C C GT 1116 

! M II Mill I I I M II I I I II I I I I I I I I I I M I 

Db 1015 AACAT TTT C C CAGT CAT CT C ACT CT AC CTAAT GG GT GAG GT T AC CAAC CAGT CCTTCCGC 107 4 

Qy 1117 AT C AC AAT C CT GC CT C AGC TT T AC AT T C AGCC CAT G ATGGGGGCCGGCCTGAATTAT 1173 

I M i I I I I I I I I I I I I II I I I I II ! I I I I I I I I 

Db 1075 AT CAC CAT C CT T C C G CAGCAAT AC CT GC GG CC AGT GGAAGAT GTGG C CAC GT C C CAAGAC 1134 

Qy 1174 GAATGTTACCGATTCGGCATTTCCCCATCCACAAATGCGCTGGTGATCGGTGCCACGGTG 1233 

MINIM MINIMI II II I I I I I M I I I 

Db 1135 GACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATC 1194 

Qy 12 34 ATGGAGGGCTTCTACGTCATCTTCGACAGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGC 12 93 

I I I I I I I I M It I I I II II I I II I I I I I N I I I I I I I I I I III 
Db 1195 ATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGC 1254 

Qy 1294 CCCTGTGCAGAAATTGCAGGTGCTGCAGTGTCTGAAATTTCCGGGCCTTTCTCAACAGAG 1353 

t I I I II I I I I II II II I II I 

Db 1255 GCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTG 1314 

Qy 1354 GAT GT AGC C AG CAAC T GT GT C C C C GCT CAGT CT T T GAGC GAG C CCAT TTT GT GGAT T GT G 1413 

I I I I I I I I I I I I I I I i II I 

Db 1315 GACATGGAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATA 1374 

Qy 1414 TCCTATGCGCTCATGAGCGTCTGTGGAGCCATCCTCCTTGTCTTAATCGTCCTGCTGCTG 1473 

MINI I I I I II I I I I I I I I I I I I I I I 

Db 1375 GCCTATGTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGT 1434 

Qy 14 74 CTGCCGTTCCGGTGTCAGCGTCGCCCCCGTGACCCTGAGGTCGTCAATGATGAGTCCTCT 1533 

II I II I II III I I 1 I I I I I MINI III 

Db 1435 CAGTGGCGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCC 1494 

Qy 1534 CTGGTCA 1540 

III I I 

Db 1495 CTGCTGA 1501 



Search completed: March 6, 2004, 00:33:12 
Job time : 161 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: March 5, 2004, 23:19:02 ; Search time 654 Seconds 

(without alignments) 

10088.696 Million cell updates/sec 

Title: US- 0 9- 6 68-314C-1 

Perfect score: 1804 

Sequence: 1 atgggcgcactggcccgggc aaaccaaaaaaaaaaaaaaa 1804 

Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 2421054 seqs, 1828716029 residues 

Total number of hits satisfying chosen parameters: 4842108 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications__NA: * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB. seq: * 

3: /cgn2__6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 

6: /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB. seq: * 

10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB. seq: * 

11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB. seq: * 

12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq: * 

13 : /cgn2_6/ptodata/2/pubpna/US10A__PUBCOMB. seq: * 

14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB. seq: * 

15 : /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB. seq: * 

16: /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB. seq: * 

17 : /cgn2_6/ptodata/2/pubpna/US60__NEW_PUB. seq: * 

18 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB. seq: * 

Pred. No. is the number of results predicted by chance to have a 

score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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US 


— 09 


Ci 1 o 

— 9/8 


-564A-195 


Sequence 


195, 


App 


x y 


1 "7 O /I /I 

1/94.4 


y 8 . 


y 


18 /9 


1 C\ 

10 


T T f~~* 

US 


C\ Ci 

-09 


t~\ r\ d 

-999 


-83 3 A- 195 


Sequence 


195, 


App 


o n 

z o 


1/84.4 


ci o 

9 8 . 


9 


1879 


1 Ci 

10 


US 


-09 


-981 


-91 5 A- 195 


Sequence 


195, 


App 


Z X 


1 "7 O A A 

X / 8 4 . 4 


y 8 . 


y 


i o "7 n 
18 /9 


1 Ct 

10 


TTO 

US 


— 09 


— 9/8 


O O A 1 a r 

-824—195 


Sequence 


195, 


App 


o o 
z z 


"1 "7 O A A 

X / 8 4 . 4 


o o 

y8 . 


y 


1 8 / y 


1 Ci 

1 0 


US 


Ci Ci 

-09 


-918 


r a r 7x -\ Ci C 

-58 5 A- 195 


Sequence 


195, 


App 


z 3 


1/84.4 


9 8 . 


9 


1879 


10 


T T f" *l 

US 


-09 


-97 8 


-42 3A- 195 


Sequence 


195, 


App 


z 4 


"1 T O A A 

17 84 . 4 


9 8 . 


9 


-1 r\ r-j 

1879 


10 


US 


-09 


-97 8 


-19 3 A- 195 


Sequence 


195, 


App 




1/84.4 


9 8 . 


9 


1879 


10 


US 


-09 


-999 


-83 OA- 195 


Sequence 


195, 


App 


Z D 


1 "7 O yl /l 

1/94.4 


y8 . 


y 


1 8 / y 


1 Ci 

10 


TTO 

US 


n n 
-09 


n t o 

-9/8 


- /5 7A-195 


Sequence 


195, 


App 


O "7 
Z / 


1 7 Q yl A 
X / 8 4 . 4 


Ci o 

y8 . 


y 


t a "~i a 

18 /y 


1 0 


T T 

US 


C\ Cl 

-09 


P\ T O 

-97 8 


-187B- 195 


Sequence 


195, 


App 


Z 8 


T *7 O A A 

1/84.4 


9 8 . 


9 


T O r— 7 O 

1879 


1 Ci 

10 


T T r^i 

US 


-0 9 


/-\ i--t /-^ 

-978 


-643A-195 


Sequence 


195 , 


App 


z y 


1/84.4 


Pi Q 

98 . 


9 


1 Cl ""7 

18 79 


1 C\ 

10 


US' 


Cl Ci 

-09 


-97 8 


-375A-195 


Sequence 


195, 


App 


3 U 


"1 "7 O /I A 

1/84.4 


98 . 


9 


18 7 9 


T C\ 

10 


t t n 

US 


-09 


AHA 

-97 8 


-29 8 A- 195 


Sequence 


195, 


App 


Q 1 
O 1 


1 7 O A A 

1/84.4 


Pi O 

98 . 


9 


1 O *7 O 

18 /9 


"1 C\ 

1 0 


T T f~1 

US 


-09- 


f~k T O 

-97 8- 


-18 8 A- 195 


Sequence 


1 Pi c 

195, 


App 




1 "7 Q /l /l 

X / 8 4 . 4 


Q O 

y8 . 


y 


1 8 / y 


10 


TTp 

US 


Ci Ci 

-0 9- 


(~i "~i d 
-97 8 


-68 1 A- 195 


Sequence 


195, 


App 


3 3 


-1 ""7 O /I /I 

1/84.4 


Pi o 

9 8 . 


9 


1 O """7 Pi 

18 79 


10 


US- 


-09- 


-97 8- 


-1 94A-195 


Sequence 


195, 


App 


Q /I 

o 4 


1 7 Q /I /l 

X / 8 4 . 4 


y8 . 


y 


1 O *7 O 

1 8 / y 


1 Ci 

10 


TTO 

US 


Ci Ci 

-09 


O Ci Ci 

-9 9 9- 


- 0 2 9 A— 195 


Sequence 


195, 


App 


o O 


1 7 Q A A 

X / 8 4 . 4 


y8 . 


y 


1 Q "7 Q 

1 8 / y 


1 Ci 

1 0 


TTO 

us- 


Ci Ci 

- (j y- 


n "7 o 
-9/8 


0 n 0 "n -toe 

- Z 9 9 A— 19 5 


Sequence 


195, 


App 


J D 


1/84.4 


CI o 

y8 . 


y 


i o *7 n 

1 8 / y 


1 Ci 

1 u 


US- 


-0 9- 


Cl ^7 O 

-97 8- 


C A A "A "1 Pi C 

-54 4 A- 195 


Sequence 


195, 


App 


■D 1 


1 "7 Q /I /I 

X / 8 4 . 4 


y8 . 


y 


1 8 / y 


"t Ci 

1 0 


T T f 

us- 


-0 9- 


O T O 

-9/8- 


-66 5 A- 195 


Sequence 


195, 


App 


^ 8 


I "7 o /i yi 
X / 8 4 . 4 


Ci o 

y 8 . 


y 


i o "~i a 

18 /y 


1 Ct 

10 


us- 


C\ Ci 

-09- 


-97 8- 


- 8 02A-195 


Sequence 


195, 


App 


o y 


X / O 4 . 4 


y 8 . 


y 


1 Q "7 Q 

x 8 / y 


i a 
Xz 


TTO 

u s- 


n n 
-10- 


- 1 b4- 


*~I A Ci 7i 1 Ci c 

- 1 4 y a— 1 y 5 


Sequence 


195, 


App 


40 


1784 . 4 


98 . 


9 


1879 


12 


us- 


-10- 


-206- 


-915-71 


Sequence 


71, 


Appl 


41 


1784 . 4 


98 . 


9 


1879 


12 


us- 


-10- 


-199- 


-670-71 


Sequence 


71, 


Appl 


42 


1784 . 4 


98 . 


9 


1879 


12 


us- 


-10- 


-201- 


-858-71 


Sequence 


71, . 


Appl 


43 


1784 . 4 


98 . 


9 


1879 


13 


us- 


-10- 


-052- 


-586-71 


Sequence 


71, . 


Appl 


44 


1784 . 4 


98 . 


9 


1879 


14 


us- 


-10- 


-174- 


-590-71 


Sequence 


71, . 


Appl 


45 


1784.4 


98 . 


9 


1879 


14 


us- 


-10- 


-176- 


-758-71 


Sequence 


71, . 


Appl 



ALIGNMENTS 



RESULT 1 
US-09-794-927-1 

; Sequence 1, Application US/09794927 

; Patent No. US2 001001 6324A1 

; GENERAL INFORMATION: 

; APPLICANT: Gurney, Mark E. 



APPLICANT: Bienkowski, Michael J. 
APPLICANT: Heinrikson, Robert L. 
APPLICANT: Parodi, Luis A. 
APPLICANT: Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND 

TITLE OF INVENTION: USES 
TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280FG 
CURRENT APPLICATION NUMBER: US/ 09/7 94 , 927 
CURRENT FILING DATE: 2001-02-27 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US9 9/2 08 8 1 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
; PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patent In Ver. 2.0 
; SEQ ID NO 1 

LENGTH: 18 04 
TYPE: DNA 
; ORGANISM: Homo sapiens 
US-09-794-927-1 

Query Match 100.0%; Score 1804; DB 9; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

I I I I i I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I | I I I I | | | | M | | | | | | | | | | | | 
Db 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

QY 61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

I I I I I I I I I M I I I I I I I I I I I I I ! I M I ! I I I I I | | | | | | | | | | | | | | | | | | | | M I I I 

Db 61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

Qy 121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 180 

I I M I I I I I I I I I I I I I I I I M t I I II I I I | I I | | | | | | | M I I I I i I I I I I I I I I I I I I 
Db 121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

Qy 181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 2 40 

I M I I I I I II I II I ! I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | || | | | | | | | | 
Db 181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 240 

Qy 241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

I M I I II I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

Qy 301 C C C C C G CAGAAG CT AC AGATT C T C GT T GAC ACT G GAAG C AGTAAC TTTGCCGTGG CAGGA 3 60 

I M I M i I I I I I I | | | | | | | | | | || | | | | | I | | | | || | M I I I I I I I I I I I I I I I I I I I I 
Db 3 01 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 360 



Qy 


361 


AC C C C G C ACT C CT AC AT AGACAC GT ACT TT GAC ACAGAG AG GT C T AGC AC AT AC C G C T C C 

1 1 1 ! 1 1 1 1 1 M 1 1 f ! 1 I 1 1 1 1 1 I 1 f I 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I 1 1 M I 1 1 | 1 I 1 

AC C C C G C ACT C C TAC AT AGACAC GT AC T TT GAC AC AGAGAG GT CT AGC ACAT AC C G CT C C 


420 


Db 


361 


420 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 I 1 1 1 1 1 1 1 f ! 1 1 I I I I ! | j | | | | | | | | | M 1 f 1 1 1 1 1 
AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


421 


480 


Qy 


481 


GAC C T C GT CAC CAT C C C C AAAGGCT T CAAT ACT T CT T TT C TT GT CAAC AT T G C C ACT AT T 

1 1 M 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GAC CTCGT CAC CAT CCCCAAAGGCTT CAAT ACTTCTTTTCTTGTCAACATTGCCACT ATT 


540 


Db 


481 


540 


Qy 


541 


T T T GAAT C AGAGAATTT CT T T TT G C CT G GGATTAAAT GGAAT GGAAT AC T T G GC C TAGCT 

1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t M 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I | I I | I | | M 1 1 1 

T T T GAAT C AGAGAATT T CTTTTTGCCTGG GAT T AAAT GGAAT G GAAT ACT T G GC C TAGCT 


600 


Db 


541 


600 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 | || | | | | | | | 
TAT GC CAC ACT T G C C AAG C CAT CAAGTT CT C T G GAGAC CT T CT T C GACT C C CT G GT GAC A 


660 


Db 


601 


660 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | 1 1 1 II 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


661 


720 


Qy 


721 


GGAT CTGGGACCAACGGAGGTAGTCTTGTCTTGGGT GGAAT TGAACCAAGTTTGT AT AAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 

GGAT C T GG GAC CAAC GGAG GT AGT C T T GT C T T G GGT GGAAT T GAAC CAAGT T T GT AT AAA 


780 


Db 


721 


780 


Qy 


781 


GGAGAC AT C T GGT AT AC C C CT AT T AAGGAAGAGT G GT ACT AC CAGAT AGAAAT T CT GAAA 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 I 1 1 1 1 1 1 1 1 I I I I 

G GAGAC AT C T G GT AT AC C C CT AT T AAGGAAGAGT G GT AC TAC CAGAT AGAAAT T C T GAAA 


840 


Db 


781 


840 


Qy 


841 


TT GGAAATT G G AGG C CAAAGC CT TAAT CT G GACT GCAGAGAGT AT AAC G C AGACAAGGC C 

1 M 1 M 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 II 1 I I I I I I I I I I I I I I I || | | | | | | | | | | | | | | || | | | 

TT G GAAATT GGAGGC CAAAGC C T TAAT C T GGAC T G C AGAGAGTAT AAC GC AGACAAGGC C 


900 


Db 


841 


900 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | M 1 1 I 1 1 1 1 1 
ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 


Db 


901 


960 


Qy 


961 


GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 
1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 M 1 1 1 
GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 


1020 


Db 


961 


1020 


Qy 


1021 


CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 
1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 I II I I | | | | | | | | || | | | M | | | | | M | | | | | | | 
CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 


1080 


Db 


1021 


1080 


Qy 


1081 


TAC C T GAG AGAT GAGAACT C C AGCAGGT CAT T C C GT AT C ACAAT C CT GC CT C AGCT T TAC 

II II tllll llllllllll lllll lllll ■■ IIJ.hltlll t ■■ 1 l l l 

1 1 1 1 1 1 1 1 1 IE 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I 1 1 I I 

TAC CT GAGAGAT GAGAAC T C CAG C AG GT CAT T C C GTAT C ACAATC C T GC C T C AGCT T TAC 


1140 


Db 


1081 


1140 


Qy 


1141 


ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 

M 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 fl 1 M 1 1 1 1 1 1 1 1 
ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 


1200 


Db 


1141 


1200 



Qy 



1201 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 



Db 


1201 


1 I 1 1 M I 1 1 i 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 1 I 1 1 1 1 1 I 1 1 ! 1 1 I I 1 1 ! 1 I f I 1 1 M | I I I 
TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 


1260 


Qy 


1261 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I | | I | t 1 1 1 1 M 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I | | | | 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


1320 


Db 


1261 


1320 


Qy 


1321 


GT GTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I | | I I | | | I | | | | | | | | | | | | || M | | | | || | 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


1380 


Db 


1321 


1380 


Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 
1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I I I I | I | | | | | | M | | | | | M | M | | | | | | | | | | | | | | 

CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1440 


Db 


1381 


1440 


Qy 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 
1 1 M 1 1 1 M 1 1 1 1 1 1 i 1 II 1 1 1 | 1 1 | | | | || | | | | | | | | | | | | | | | M | | | | | | | | | | | | 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 


1441 


1500 


Qy 


1501 


C GT GAC C CT GAGGT C GT CAAT GAT GAGT C CT C T CT GGT C AGACAT C GCT GGAAAT GAAT A 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 | | | | | | | | | | | 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | 
C GT GAC C CT GAGGT CGT CAAT GAT GAGT C C T C T CT GGT C AGACAT C GCT GGAAAT GAAT A 


1560 


Db 


1501 


1560 


Qy 


1561 


GC CAGGC C T GAC C T CAAG CAAC CAT GAACT C AGCT AT TAAGAAAAT CAC AT T T C C AG GGC 
1 I 1 1 t 1 1 1 II II 1 1 1 1 1 1 1 1 I I I M I I I I | | | | | | M | | | | | | | | M | | | | | | | | | | | | | 
GC CAGGC C T GAC CT CAAG CAAC CAT GAACT C AGCT AT TAAGAAAAT CAC AT T T C C AGGGC 


1620 


Db 


1561 


1620 


Qy 


1621 


AGCAGCCGGGATCGAT GGT GGCGCTTTCTCCTGTGCC CAC CCGTCTT CAAT CTCTGTTCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


1621 


1680 


Qy 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 

1 i M 1 1 I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 M 1 1 I I I I I I I I | | | | | | | | | | | | | | || M 1 1 1 1 1 1 1 1 1 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1681 


1740 


Qy 


1741 


C T C C CT ACT T C CAAGAAAAAT AAT TAAAAAAAAAACT T CAT T CT AAAC CAAAAAAAAAAA 

1 1 1 1 1 1 1 1 M 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 I I I I | | | | | | | | | 

C T C C C TACT T C CAAGAAAAAT AAT TAAAAAAAAAACT T CAT T CTAAAC CAAAAAAAAAAA 


1800 


Db 


1741 


1800 


Qy 


1801 


AAAA 1804 
1 1 1 1 

AAAA 1804 




Db 


1801 





RESULT 2 
US-09-795-847-1 

Sequence 1, Application US/09795847 
Patent No. US20010018208A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF 



AND 



Gurney, Mark E . 
Bienkowski , Michael J . 
Heinrikson, Robert L . 
Parodi, Luis A. 
Yan, Riqiang 

INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THERE FOR, 



TITLE OF INVENTION: USES 



TITLE OF INVENTION: THEREFOR 

FILE REFERENCE: 28341/6280DE 

CURRENT APPLICATION NUMBER: US / 0 9/ 7 95 , 8 4 7 

CURRENT FILING DATE: 2001-02-28 

PRIOR APPLICATION NUMBER: 09/416,901 

PRIOR FILING DATE: 1999-10-13 

PRIOR APPLICATION NUMBER: 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 09/404,133 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US99/208 81 

; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patent In Ver. 2.0 

; SEQ ID NO 1 

; LENGTH: 18 04 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-795-847-1 



Query Match 100.0%; Score 1804; DB 9; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 
M M 1 t 1 1 1 II 1 1 1 1 1 1 1 M 1 I 1 1 1 1 1 1 1 1 1 I I | | | | I | | | | | j | | | M 1 1 I 1 1 t 1 1 1 1 1 
ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


1 


60 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 
1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 
GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


61 


120 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ( 1 M 1 1 1 1 1 1 1 1 1 i I I I I I I I I 
CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


121 


180 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 If 1 1 1 1 1 II 1 1 1 1 1 1 i 1 1 1 i Mill 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


181 


240 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 | 1 1 | | 1 | | 
GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


241 


300 


Qy 


301 


C C C C C G C AGAAGC T ACAGAT T C T C GT T GACACT G GAAGCAGTAAC T T T GC C GT G GCAGGA 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I M I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | || | | | | | 

CCCCCGCAGAAGCTACAGATTCTCGTT GACACT GGAAGCAGTAACTTTGCCGTGGCAGGA 


360 


Db 


301 


360 


Qy 


361 


AC C C C G CAC TC CT AC AT AGAC AC GTAC T T T GAC ACAGAGAGGT CT AG C ACAT AC C GCT C C 

1 t 1 1 1 1 II 1 If 1 1 1 1 1 1 1 If 1 1 1 1 1 1 1 1 M 1 I I t 1 1 M 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AC C C C GC AC T C C T AC AT AGAC AC GT ACT T T GAC AC AGAGAG GT C TAG CAC AT AC C GC T C C 


420 


Db 


361 


420 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 
1 1 1 1 1 1 1 1 1 M 1 f 1 1 i 1 1 I I I | | | | | | | | | | | | | | | | M 1 1 ! 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 M 1 
AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


421 


480 



Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 

1 1 1 I 1 1 1 1 M 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I 1 I 1 1 1 M i 1 1 I M 1 1 1 f I 1 1 1 I ! 1 t 
GAC CT C GT CACCAT C C C CAAAG GC T T CAAT ACT TCTTTTCTT GT CAAC AT T G CC ACT AT T 


540 


Db 


481 


540 


Qy 


541 


T T T GAAT C AGAGAAT TTCTTTTTGC CT GG GAT T AAAT G GAAT GGAAT ACT T G GC C T AGCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 I 1 1 1 1 1 II 1 1 1 I I 1 I I I | | | | | | | | | 

TTT GAAT CAGAGAATTTCTTTTTGCCTGGGATTAAAT GGAAT GGAAT ACTTGGCCTAGCT 


600 


Db 


541 


600 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 

1 1 M 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 1 I I I | | | | 1 1 1 1 f 1 1 1 1 1 1 1 ! II ! 1 1 1 1 1 1 1 M I 1 1 1 
TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


601 


660 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 
I 1 1 1 1 1 1 M 1 II 1 1 1 II 1 1 1 1 1 1 1 I I I M | I I | | | | | | | | | | | | | | | | | | | | | M | | | | | 
CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


661 


720 


Qy 


721 


GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 
1 1 1 1 1 M i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | I | | | | 
G GAT CT GGGAC CAAC G GAGGTAGT CT T GT CTTGGGTG GAAT TGAACCAAGTTTGT AT AAA 


780 


Db 


721 


780 


Qy 


781 


GGAGACAT CT G GTAT AC C C CT AT T AAGGAAGAGT G GT AC T AC CAGAT AGAAAT T C T GAAA 

1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 M 

GGAGACAT CT GGT AT AC C C CT AT TAAGGAAGAGT GGT AC T AC CAGAT AGAAAT T CT GAAA 


840 


Db 


781 


840 


Qy 


841 


T T G GAAAT T GGAGGC CAAAG C CT T AAT CT G GACT GC AGAGAGTAT AAC GC AGACAAGGC C 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | || | | | | | | | 
TT GGAAATT GGAGGC CAAAGC CTTAAT CT GGACT GCAGAGAGTATAAC GCAGACAAGGCC 


900 


Db 


841 


900 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I 1 1 1 M 1 1 1 1 1 1 1 1 1 1 
ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 


Db 


901 


960 


Qy 


961 


GAAGCT GT GGC C CGCGCATCTCTGATTCCAGAATTCTCT GAT GGTTTCT GGACT GGGTCC 

1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 

GAAGCTGT GGC C CGCGCATCTCTGATTCCAGAATTCTCT GAT GGTTTCT GGACT GGGTCC 


1020 


Db 


961 


1020 


Qy 


1021 


CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 

1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 


1080 


Db 


1021 


1080 


Qy 


1081 


T AC C T GAGAGAT GAGAAC T C CAG C AGGT CAT T C C GTAT C ACAAT C CT GC CT C AGC T T T AC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 
T AC CT GAGAGAT GAGAACT C C AGC AG GT CAT T C C GTAT C ACAAT C C T GC CT C AGCT T T AC 


1140 


Db 


1081 


1140 


Qy 


1141 


ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 
ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 


1200 


Db 


1141 


1200 


Qy 


i o n i 

12 U 1 


TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 
TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 


1260 


Db 


1201 


1260 


Qy 


1261 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 
1 1 1 1 M 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGC AGAAAT TGCAGGTGCTGCA 


1320 


Db 


1261 


1320 



Qy 1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

I I I I t I I I I I I ! I I I I I I I I M I I 1 I I ( I I I I I I I I II I I I I I I I I i I ! I I I I I I | | | | | 
Db 1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

Qy 1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

I I M E I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I 
Db 1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1440 

Qy 14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I | I M 
Db 14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

Qy 15 01 C GT GAC C C T GAGGT C GT C AAT GAT GAGT C C T CT CT GGT CAGAC AT C G CT G GAAAT GAAT A 15 60 

I I I I I I I I I I I I I II I I I I I 1 I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II 
Db 1501 C GT GACC CT GAG GT C GT CAAT GAT GAGT CCTCTCTGGT CAGAC AT C G CT GGAAAT GAAT A 1560 

Qy 1561 G C C AGGC CT GAC CT CAAG CAAC CAT GAACT CAGCT AT TAAGAAAAT CAC AT T T C CAGGGC 162 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I M M I I I I I I I I I I I I I 
Db 1561 GC CAGGC C T GAC CT CAAGCAAC CAT GAAC T C AG CT AT TAAGAAAAT CAC AT T T C CAG GGC 1620 

Qy 1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I | 
Db 1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

Qy 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 17 4 0 

M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I 
Db 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

Qy 1741 CT C C CT ACT T C CAAGAAAAATAAT TAAAAAAAAAACT T CAT T CT AAAC C AAAAAAAAAAA 18 00 

I I I I I I I I I I M I I I I I II I I I II M I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | M 
Db 1741 CT C C CT ACT T C CAAGAAAAATAAT TAAAAAAAAAACT T CAT T CT AAAC CAAAAAAAAAAA 18 0 0 

Qy 1801 AAAA 1804 

I I I I 

Db 1801 AAAA 1804 



RESULT 3 
US-09-794-743-1 

Sequence 1, Application US/09794743 
Patent No. US2 0 010 02 1391A1 
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APPLICANT 
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APPLICANT 
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; SEQ ID NO 1 

LENGTH: 18 04 
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Query Match 100.0%; Score 1804; DB 9; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


i 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 

M 1 1 1 1 1 1 1 1 ! I 1 1 1 1 1 1 1 1 1 1 1 I I M 1 I ! 1 I 1 M I 1 1 1 I I I I I I I I | | | | M ! 1 1 f 1 1 1 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


l 


60 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 

1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 M 1 1 1 1 1 1 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


61 


120 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

1 1 1 M 1 I 1 1 1 1 1 M 1 1 1 1 1 1 1 I I I I M 1 ! 1 1 I I 1 ! 1 1 1 1 1 1 1 1 1 1 1 I I I I I | | | | | | | | | 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


121 


180 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 I 1 I 1 t 1 ! 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 ) 1 t 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I M | | M M 1 1 1 1 I 1 1 1 1 M I 1 1 1 1 M 1 1 1 1 II II 1 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


181 


240 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 
1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I I M 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 I | | | | | | | | | | i | | | | | | | | | | 
GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


241 


300 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 
CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 


360 


Db 


301 


360 


Qy 


361 


AC C C C GCACT C C T AC ATAGACAC GT ACT TT GAC AC AGAGAGGT C TAG CACAT AC C GCT C C 

1 N 1 1 1 1 1 1 1 1 1 1 I I | | I | | | | | | | | | | | | | | | | | t M | | | | | | | | | | | | | | | 

AC C C C G C AC T C CT AC ATAGACAC GT ACT T T GAC AC AGAGAGGT CT AG CACAT AC C GCT C C 


420 


Db 


361 


420 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 
1 1 N 1 1 1 1 1 I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | f | | | | | | j | | | | | | | | | | | [ | 

AAG GGC T T T GAC GT C ACAGT GAAGT AC ACACAAG GAAG C T G GAC G GGCT T C GT T GG GGAA 


480 


Db 


421 


480 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 
M 1 M 1 M 1 1 1 1 1 | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | | j | | | | | | 

GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


481 


540 


Qy 


541 


T T T GAAT C AGAGAAT TTCTTTTTGC C TGG GAT T AAAT GGAAT G GAAT ACT TGGC CT AG C T 
1 1 N 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | M | | | | | | | | | | | 


600 



541 T T T GAAT CAGAGAAT TTCTTTTTGCCT G GGAT T AAAT GGAAT G GAAT AC TT GG C CT AG CT 600 

601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 

I I f M M I I I I I I I I I I I I I i I I I I | I | | M | | | | | | | | | | | | | | | | | | | | M | || | || | 
601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

I M I I I I M I M I I I I I I I I I M I II I I I I I I I I I I I I i | | | | | | | | | | | | | | M I I I I I 
661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

721 GGAT CTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTAT AAA 780 

I I I I I I I I I I I M I M I II I I I II M | I | | | | | | M | | | | | | | | | M I I I I I I I | | | | | | 
721 GGAT CT GGGAC CAAC GGAG GT AGT CTTGTCTT GGGT G GAAT T GAAC CAAGT T T GTATAAA 7 80 

7 81 GGAGACAT CT GGT AT AC C C C T ATTAAGGAAG AGT G GTAC T AC C AGAT AGAAAT T CT GAAA 84 0 

I I I I I I I I I I I I I M I I I I I M I I M I I I I I I I I I I I I I I I I I I I | | | | | M | | | | | | | | 
781 GGAGACAT C T GGT AT AC C C C TAT TAAGGAAGAGT GGT ACT AC CAGAT AGAAATT CT GAAA 84 0 

841 T T GG AAAT T GGAG G C CAAAGC CT T AAT C T GGAC T G C AGAGAGT ATAAC G CAGACAAG G C C 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I | | | | | 

841 T T GGAAAT T GGAG GC CAAAGC CT T AAT C T GGAC T G C AGAGAGT ATAAC GC AGACAAG GC C 900 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | | | | | | M | | 
901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 

I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | 
961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

N I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I | | | | | | M | | | 
1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

1081 T AC C T GAGAGAT GAGAACT C C AGCAG GT CAT T C C GT AT CACAAT C CT GC CT C AG C T T T AC 114 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | 
1081 T AC CT GAGAGAT GAGAACT CCAGCAGGT CAT T C C GT AT CACAAT C CT GC CT C AGC T T TAC 114 0 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 0 0 

I I I M I I I I I I I I I I I | | | | | | | | | | | I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1141 AT T CAG C C CAT GAT GGGGGCCGGCCT GAAT TAT GAAT GT T AC C GAT T C GG CAT T T C C C C A 1200 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

I I I I I I I I I I I I I I I I I I I I I I || | | | I I I I I I I I I I I I I I 

1201 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

1261 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGC AGAAATT GCAGGTGCTGCA 132 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | M I I I I I I I I I I I | | | | | | | | | | 
1261 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATT GCAGGTGCTGCA 132 0 

132 1 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

I M I I I I I I I I I I I I I | I I I I I I I I I I I I I M I I I I I I I I I I I I I | | | I | | | | | | | | | | I 
1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

M I I I II I I I I I I I I I I I M M I I I I I I M M I I I I I I I I I II I I f I | I M | | | | | | | | | 
1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 



Qy 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 II 1 t 1 1 1 1 1 I 1 1 1 1 1 | | | | | | | | | | | | | | 
GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 


1441 


1500 


Qy 


1501 


CGT GAC C CT GAGGT C GT CAAT GAT GAGT CCTCTCTGGT C AGACAT C GCT G GAAAT GAAT A 
M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | I 1 1 1 1 I 1 I I I f I | | | | | | | | | | | | | | | | | j 
CGT GAC C C T GAGGT CGT CAAT GAT GAGT CCTCTCTG GT C AGACAT C GCT G GAAAT GAAT A 


1560 


Db 


1501 


1560 


Qy 


1561 


GC CAGGC C T GAC C T CAAG CAACC AT GAACT CAG C TAT T AAGAAAAT CAC AT T T C CAG GG C 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 I I | I | I 

GCCAGGCCTGACCTCAAGCAACCATGAACTCAGCTATTAAGAAAATCACATTTCCAGGGC 


1620 


Db 


1561 


1620 


Qy 

Db 


1621 
1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 M 1 1 1 1 1 1 1 1 | | | | | 
AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 
1680 


Qy 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 
M II II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I I I I I I | | | | | | | | 1 | | | || j | | | M M 1 1 1 1 1 1 1 1 1 1 
GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1681 


1740 


Qy 


1741 


CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAAACCAAAAAAAAAAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 
C T C C C T AC T T C C AAGAAAAAT AAT T AAAAAAAAAAC T T CAT T C T AAAC C AAAAAAAAAAA 


1800 


Db 


1741 


1800 


Qy 


1801 


AAAA 1804 




Db 


1801 


MM 

AAAA 1804 





RESULT 4 
US-09-794-748-1 

Sequence 1, Application US/09794748 
Patent No. US2 002 0037315A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Gurney, Mark E. 
Bienkowski, Michael J* 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER 1 S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND 

TITLE OF INVENTION: USES 
TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280JL 
CURRENT APPLICATION NUMBER: US/ 0 9/ 7 94 , 7 4 8 
CURRENT FILING DATE: 2001-02-27 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/208 81 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 



; NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 1 

LENGTH: 18 04 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-794-748-1 

Query Match 100.0%; Score 1804; DB 9; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 18 04; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


l 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 
M 1 1 1 I 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 I 1 t I I | t 1 ! 1 1 1 1 1 1 1 1 1 II I I I I | ( | || | | 1 II 1 
ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


l 


60 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | M 1 1 1 1 

* ' •■'iriiitii * 1 1 1 1 1 1 III 1 IlllllllfPJJJ IJIFIIIfll 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


61 


120 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 
M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I M | | | | | | | | | | | | | | | | | | | | | | | | | | 1 1 1 I 

* * * rrifiiiiiiiii | 1 1 1 1 1 l|| 1 1 1 1 | Fllfllllll 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


121 


180 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 
1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 ! 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

T * ir» r||| ■■■rillltlVlllfflJI 1 | 1 | 1 1 1 f 1 1 1 1 I 1 1 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


181 


240 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 I 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 I I I I I M I I | | | | | | | M 

,,i * i,ir, ' ,,, »i'i»ifiiiiiiiiiiiiiiiililP 

GT AG ACAAC C T GC AG GG G GAC TCTGGCCG C GGCT AC T AC C T G GAGAT G C T GAT C G G GAC C 


300 


Db 


241 


300 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 
M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I I 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | 1 1 1 1 1 1 1 1 

» r ■ i ■ i ■ ■ ■ ■ i r i i Pllllflllllll II 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 


360 


Db 


301 


360 


Qy 


361 


AC C C C GC ACT C CT AC ATAGACAC GT AC T T T GAC ACAGAGAGGT CT AGCACAT AC C GCT C C 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 

AC C C C GC ACT C CT AC AT AGACAC GT ACT T T GACACAGAGAGGT CT AGCACAT AC C GCT C C 


420 


Db 


361 


420 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


421 


480 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 
I i I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 II 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 I 
GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


481 


540 


Qy 


541 


TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 

1 1 > 1 1 1 M 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 i 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 


600 


Db 


541 


600 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 

M 1 1 M 1 1 M I 1 1 I 1 1 1 I 1 1 1 M i I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 
TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


601 


660 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCC GGCT TGCCCGTT GCT 


720 



Db 


661 


CAAGC AAACAT C C C CAAC GT T TT C T C C AT GC AGAT GT GT GGAGC C GGCT T G C C C GT T GCT 


720 


Qy 


721 


GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 
1 M 1 t 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 f 1 1 1 | 1 1 [ 1 1 1 1 t 1 1 1 1 1 1 M 1 i 1 1 1 1 1 I 1 1 1 1 I 1 1 | I II 
GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 


780 


Db 


721 


780 


Qy 


781 


G GAGAC AT CT G GT AT AC C C C TAT T AAGGAAGAGT G GT AC T AC C AGAT AGAAAT T C T G AAA 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | I I 1 

G GAGAC AT C T GGT AT AC C C CT AT T AAG GAAGAGT GGT AC TAC C AGAT AGAAAT T C T GAAA 


840 


Db 


781 


840 


Qy 


841 


TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I 1 1 1 1 1 1 ! 1 I I I I I I I M 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 

T T GGAAAT T GGAGGC C AAAG C CT TAAT CT G GACT GC AGAGAGT ATAAC G CAGAC AAGGCC 


900 


Db 


841 


900 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 II 1 1 II 1 1 1 1 
ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 


Db 


901 


960 


Qy 


961 


GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 

1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 IE 1 1 1 1 1 1 1 1 1 1 1 1 

GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 


1020 


Db 


961 


1020 


Qy 


1021 


CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 

1 1 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 | 1 | 1 1 1 1 
CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 


1080 


Db 


1021 


1080 


Qy 


1081 


T AC CT GAGAGAT GAGAACT CCAGCAGGT CATT C CGT AT CACAATCCT GC CT CAGCTT T AC 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I | M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TAC C T GAGAGAT GAGAACT C CAGC AG GT CATT C C GT AT C ACAAT C CT GC CT C AGCT T TAC 


1140 


Db 


1081 


1140 


Qy 


1141 


AT T CAGCC CAT GAT GGG GGC C GG C CT GAAT TAT GAAT GT TAC C GAT T C GG CAT T T C C C C A 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 I I I I II I I I I 

ATT CAGCCCATGATGGGGGCCGGCCTGAATTAT GAAT GTTACCGATTCGGCATTTCCCCA 


1200 


Db 


1141 


1200 


Qy 


1201 


TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I 
TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 


1260 


Db 


1201 


1260 


Qy 


1261 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 
1 1 1 M 1 1 1 1 1 1 1 II II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


1320 


Db 


1261 


1320 


Qy 


1321 


GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


1380 


Db 


1321 


1380 


Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 

M 1 1 1 1 M 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I t | | | | | | | | | | | | | | | | 1 1 1 
C AG J. CI I 1 GAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1440 


UD 


1 opi 
X .3 o I 


1440 


QY 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 
1 1 1 i 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | 1 1 1 1 
GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 


1441 


1500 


Qy 


1501 


C GT G AC C C T GAGGT CGT CAAT GAT GAGT C CT C T CT GGT CAGAC AT C GC T GGAAAT GAAT A 


1560 



I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I I I | | I | I I | | | | | | | | | | | | | | | | | | | | 



Db 


1501 


C GT GAC C CT GAG GT C GT C AAT GAT GAGT C CT CTCT G GT CAGACAT C GC T G GAAAT GAAT A 


1560 


Qy 


1561 


G C C AG GC CT GAG C T CAAG CAAC CAT GAAC T CAGCTAT T AAGAAAAT C ACAT T T C CAGGG C 

1 1 M 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 II 1 1 I 1 ! 1 1 1 1 1 1 1 II 1 1 1 1 1 i ! I 1 1 I I I I I II I I I | I I I 

G C C AG GC CT GAC CT CAAG CAAC CAT GAAC T C AGC TAT T AAGAAAAT C AC AT T T C CAG GGC 


1620 


Db 


1561 


1620 


Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II | 1 i 1 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


1621 


1680 


Qy 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 II 1 1 1 1 1 1 1 1 I 1 1 1 
GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCA*lGCTTTCAAATC 


1740 


Db 


1681 


1740 


Qy 


1741 


CT C C CT AC T T C CAAGAAAAAT AAT T AAAAAAAAAACT T CAT T CTAAAC CAAAAAAAAAAA 

1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

CT C C C T AC T T C CAAGAAAAAT AAT T AAAAAAAAAACT T CAT T CTAAAC CAAAAAAAAAAA 


1800 


Db 


1741 


1800 


Qy 


1801 


AAAA 1804 




Db 


1801 


1 1 1 1 

AAAA 1804 





RESULT 5 
US-09-794-925-1 

Sequence 1, Application US/09794925 
Patent No. US2 002 0064 819A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Gurney, Mark E. 
Bienkowski , Michael J. 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND USES 

TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280HI 
CURRENT APPLICATION NUMBER: US/ 09/ 7 94 ,925 
CURRENT FILING DATE: 2001-02-27 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/2 08 8 1 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 1 
LENGTH: 18 04 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-09-794-925-1 



Query Match 100.0%; Score 1804; DB 9; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 18 04; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 

I 1 I 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 I 1 1 ! 1 1 1 1 f 1 1 1 1 1 1 1 t E I I 1 1 M 1 f 1 1! I 1 1 f I 1 I 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


1 


60 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 
1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | 
GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


61 


120 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 
II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 If 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 I II 1 II 1 1 1 1 1 1 I 1 1 I 
CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


121 


180 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IE 1 1 1 f 1 1 1 1 t II 1 1 1 1 E 1 1 i 1 1 1 I 1 1 1 1 1 1 1 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


181 


240 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 
1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 | | | | | 
GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


241 


300 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 ! 1 1 1 1 IE 1 1 II 1 1 1 1 1 1 1 1 1 
CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 


360 


Db 


301 


360 


Qy 


361 


ACCCCGCACTCCTACATAGACACGTACTTTGACACAGAGAGGTCTAGCACATACCGCTCC 
1 ( 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 | | | | || | | | | | | | | | | 
AC C C C GC ACT C CT ACAT AGAC AC GT ACT T T GACAC AGAGAG GT C T AGC AC AT AC C GC T C C 


420 


Db 


361 


420 


Qy 

Db 


421 
421 


AAG GGCT T T GAC GT CAC AGT GAAGT ACAC ACAAG GAAGC T GGAC G G G C T T C GT T G GG GAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I 
AAG GGCTTT GAC GTCACAGTGAAGTACACACAAGGAAGCT GGAC GGGCTTCGTTGGGGAA 


480 
480 


Qy 


481 


GAC CT C GT CAC CAT C C C CAAAGGC TT CAAT AC TTCTTTTCTT GT CAAC AT T GC C ACT ATT 

1 1 1 M 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | 
GAC CT C GT CAC CAT C C C CAAAGGCT T CAAT ACTT CT T T T C T T GT CAAC AT T G C C ACT AT T 


540 


Db 


481 


540 


Qy 


541 


T T T GAAT C AGAGAAT TTCTTTTTGCCTGG GAT T AAAT GGAAT G GAATAC T T G GC C T AGCT 

IE 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I E 1 1 1 1 1 1 1 1 

T T T GAAT C AGAGAAT TTCTTTTTGCCT GGGAT TAAAT GGAAT G GAATAC T T GGC C T AGC T 


600 


Db 


541 


600 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 
i i i i i t i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i ■ i i 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 IE 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 ( ! 1 1 1 1 1 1 1 i II II 1 1 I I 1 1 1 I 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


601 


660 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 
! 1 1 1 1 1 1 1 1 1 IE 1 1 If 1 1 1 1 1 f 1 1 1 1 If 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I f I ! | I I 1 | | | | | | I | | | 
CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


661 


720 


Qy 


721 


GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 If 1 II 1 1 1 1 1 | 1 1 1 1 1 
GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 


780 


Db 


721 


780 



Qy 



7 81 GGAGACATCTGGTATACCCCTATTAAGGAAGAGTGGTACTACCAGATAGAAATTCTGAT^A 84 0 



781 GGAGAC AT CT GGT AT ACC C C TAT T AAG GAAGAGT GGT ACT AC CAG AT AGAAAT T CT GAAA 84 0 

841 T T GGAAATT GGAG GC C AAAG C CT T AAT C T G GACT GCAGAGAGT ATAAC GC AGACAAG GC C 90 0 

I I I I I M M I I I I I I I I I I I I I I t I I I I II | | | | i | | | | | M M I I I I t I I I M I I I I I I 
841 T T GGAAATT G GAG G C C AAAG C C T T AAT CT G GACT GC AGAGAGT AT AAC GC AGACAAGGC C 900 

9 01 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 
I I M I I I I I I I I | | | | I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | M | | | | | | | 

9 01 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

9 61 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

I I f I I I I M I I M I M I I I I I I I I I I I I M I I I I II I I I | | | | | | | | | | | | | | | | | | | | | 
961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

M II t I I I I I I I I I I I I I I I I I | I I | | | | | | | | | | | | | | | | | | M I I I I I I I I I | || | | | 
1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

1081 T AC C T GAGAGAT GAGAACT C CAG CAGGT CAT T C C GT ATCACAAT C C T G C CT CAG C T T TAC 1140 

i I I I I I I M I I II I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
1081 TAC C T GAGAGAT GAGAACT C CAG CAGGT CAT T C C GT ATCACAAT C C T GC CT CAG C T T TAC 114 0 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 00 

I I I I I I I M I I I II I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | || 
1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 0 0 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

M I I I I I I I I I I I I II I M I I I I I I II I I I I I I M I I I I | | I I II I I I I I I I I I I I II I I 
12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 
I I I I I I I I I I I I I M I I I I I I I I I I I I I I I | | | | | | | | | || | | | | | | | | | | | | | | | | | | | 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 1380 

N I I I I II I I I I I I I I I I I I I I I I I I I I I | | | | f | | | | | | | | | | | | | | I I I I I I I I I I I I 
1321 GT GTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

13 81 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 144 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I | | | | | | | | | | | | | 

13 81 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 144 0 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

I I M I I M II I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | || | | | | | | | | | | | | 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

1501 C GT GAC C C T GAG GT C GT CAAT GAT GAGT CCTCTCTGGT C AGAC AT C G C T GGAAAT GAAT A 1560 

N I I I I I I I I I I f I I I I | | | | | | I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1501 C GT GAC C C T GAG GT C GT CAAT GAT GAGT CCTCTCTGGT C AGAC AT C GCT GGAAAT GAAT A 1560 

1561 GC CAGGC CT GAC CT CAAGC AAC CAT GAAC T C AGCT AT T AAGAAAAT C AC AT T T C C AGGGC 162 0 

I I M I I I I I I I I I I I I If I I I I I I M I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | I I I 

15 61 GCCAGGCCTGACCTCAAGCAACCATGAACTCAGCTATTAAGAAAATCACATTTCCAGGGC 162 0 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 
I I I I I I I I I I I I I I I I I I I I I I I I | I | | | | | I I I I I I I I I I I I I I I I I I I I I I | | | I I I I 



Db 1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

Qy 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 17 4 0 

I f I I I I I M I I I I I I I i I I I t I I I I I I M I I M I I M I I I I t I I I I I I I I I I I I I | | | | | 

Db 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 17 4 0 

Qy 1741 C T C C CT ACT T C CAAGAAAAATAATT AAAAAAAAAAC T T CAT T CT AAAC CAAAAAAAAAAA 18 00 

M I I I I I I I I I I I ! I I I I I I I f I I | | | | M | | | | | | | | | | f | | | M I I I M M I I I I I I I 
Db 1741 C T C C CT ACT T C C AAGAAAAAT AAT T AAAAAAAAAAC T T CAT T C T AAAC CAAAAAAAAAAA 18 00 

Qy 1801 AAAA 1804 

II I I 

Db 1801 AAAA 1804 



RESULT 6 
US-09-681-442-1 

Sequence 1, Application US/09681442 
Patent No. US2 002 008 1 634A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Gurney, Mark E. 
Bienkowski, Michael J. 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan, Riqiang 

; TITLE OF INVENTION: ALZHEIMER 1 S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND USES 

TITLE OF INVENTION: THEREFOR 

FILE REFERENCE: 28341/6280FG 

CURRENT APPLICATION NUMBER: US/09/681, 442 

CURRENT FILING DATE: 2001-04-05 

PRIOR APPLICATION NUMBER: 09/416,901 

PRIOR FILING DATE: 1999-10-13 

PRIOR APPLICATION NUMBER: 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 09/404,133 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US 9 9/2 0 8 8 1 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 60/101,594 

PRIOR FILING DATE: 1998-09-24 

NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patent In Ver. 2.0 

SEQ ID NO 1 

LENGTH: 18 04 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-681-442-1 



Query Match 100.0%; Score 1804; DB 9; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I 

Db 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 



Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 
t f 1 M M 1 II 1 1 1 1 1 1 II f 1 II 1 1 1 I 1 1 1 1 j 1 1 1 I 1 1 | | | | | M | | | M | | | | M 1 1 1 1 1 
GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


61 


120 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

1 M 1 1 M 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 | 1 I I 1 1 1 1 1 I 1 | | M 
CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


121 


180 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 
1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 I I M | | M 1 | | | M | M | | | | | | | | | | | | | | | | | | | | | | | 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


181 


240 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 

M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


241 


300 


Qy 


301 


C C C C C GCAGAAGCT AC AGAT T CT C GT T GAC AC T G GAAGC AGTAACT TT G C C GT GG CAGGA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
C C C C C GCAGAAGCT AC AGAT T CT C GT T GAC AC T GGAAGC AGTAACT TT GC C GT GG CAG GA 


360 


Db 


301 


360 


Qy 


361 


AC C C C GCACT C C TAC AT AGAC AC GT ACT T T GACAC AGAGAG GT C TAGC AC ATAC C GCT C C 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 

AC C C C GC AC T C CTAC AT AGACAC GT AC T TT GACAC AGAGAG GT C TAGC AC ATAC C GC T C C 


420 


Db 


361 


420 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


421 


480 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 

E 1 1 M 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 j 1 1 1 1 I 1 1 1 f 1 1 1 1 1 1 1 1 1 1 

GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


481 


540 


Qy 


541 


TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 | | | | | | | | | 
TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 


600 


Db 


541 


600 


Qy 


601 


TAT GC CACACTT GC CAAGC C AT CAAGTT CT C T G GAGAC C T T CT T C GACT C C CT GGT GACA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TAT GC CACAC TT G C CAAGC CAT CAAGTT CT CT GGAGAC C T T CT T C GACT C C CT G GT GACA 


660 


Db 


601 


660 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 
M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 If I | I | I | | | | | | | | | | | | I | | | | | | | | | | | | | | | | 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


661 


720 


Qy 


721 


GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 
1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 | | | M | i | | | | | | | | | | | | | | | | | | | | | | | M | 
GGAT C T G G GAC CAAC G GAGGT AGT CT T GT C T T G G GT GGAAT T GAAC CAAGT T T GT AT AAA 


780 


Db 


721 


780 


Qy 


781 


GGAGACAT CT GGTATAC CCCT ATTAAGGAAGAGTGGT ACTAC CAGATAGAAATT CT GAAA 

1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 I 1 1 1 1 I | I | | | | | | | | | | | | | | | | | | | | | 

GGAGACAT CT GGT AT AC CC C TAT T AAGGAAGAGT GGT ACT AC C AGAT AGAAAT T CT GAAA 


840 


Db 


781 


840 


Qy 


841 


TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 
T T G GAAAT T G GAGGC CAAAGC C T T AAT C T G GAC T G C AGAGAGT AT AAC GCAGACAAGGC C 


900 


Db 


841 


900 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 



Db 901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 9 60 

Qy 961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

I I M I I I M I I I t I I I II I I I I I I I I I I I I I I | | | | | j M | | | | | | | | | | | | M I I I I I I 
Db 961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

Qy 1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1080 

I I M I I I I I I I I M I I I I I I | | I I I | I M | | | | | | | | | | | | | | | | | | | | M | | | | | | | | | 
Db 1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

Qy 10 81 T AC CT GAGAGAT GAGAACT C CAGCAGGT CAT T C C GT AT C ACAAT C CT GC C T CAG CT T T AC 114 0 

M I I I I I I I I I I I I I I I M I I I II I I I M I I I I I I I I I I I | | | I I I I I I I I I I I I I I I I I 
Db 1081 T AC CT GAGAGAT GAGAACT C CAGCAGGT CAT T C C GT AT C ACAAT C CT GC C T C AG CT T T AC 114 0 

Qy 1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1200 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I M 
Db 1141 AT T C AGC C CAT GAT G GGGGC C GGC CT GAAT TAT GAAT GT T AC C GATT C G G CAT T T C C C C A 12 00 

Qy 12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | | | | 
Db 1201 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1260 

Qy 1261 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1320 

I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

Qy 1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 13 8 0 

I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I | | | | | | | | I I I I I I I I I I I I I I I I I M 
Db 1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

Qy 1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

M I I I I I I I I I I f I I I I I I I I I I I I I I I I M I I I I I I I I I I I II I I I I I I I I I I I II I I I 
Db 13 81 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

Qy 14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 15 0 0 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I | | | | | | I I I I I I II I I I I I I I II I I I I 
Db 1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 15 0 0 

Qy 1501 C GT GAC C CT GAGGT C GT CAAT GAT GAGT C C T CT C T GGT CAG AC AT C GC T G GAAAT GAAT A 15 60 

N I I I I I I I I II I I I I I I I I I I I I | I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1501 C GT GAC C C T GAGGT C GT CAAT GAT GAGT CCTCTCTG GT CAG AC AT C GCT G GAAAT GAAT A 1560 

Qy 1561 G C C AGG C C T GAC CT C AAGCAAC CAT GAACT C AGCT AT TAAGAAAAT CAC AT T T C C AGGGC 162 0 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | 
Db 1561 GC C AGGC CT GAC CT CAAGCAAC CAT GAACT C AGC TAT TAAGAAAAT CAC AT T T C C AGGGC 1620 

Qy 1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

I I I I I I I I I M I I I I I I I I I I I I I I I M II I II I I I I I I I I I I I I | | | | I I I I M I I I I I 
Db 1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

Qy 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1740 

I I I I M I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I | | I | | | | | | I I I I 
Db 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

Qy 17 41 CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAAACCAAAAAAAAAAA 18 00 

I I M I I I I I I I I I I I I I I I I I I I | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 1741 CT C C C TACT T C CAAGAAAAAT AAT TAAAAAAAAAAC T T CAT T CT AAAC CAAAAAAAAAAA 18 00 



Qy 

Db 



1801 AAAA 1804 
I I I I 

1801 AAAA 1804 



RESULT 7 
US-09-869-414-1 

Sequence 1, Application US/09869414 
Publication No. US20030077226A1 
GENERAL INFORMATION: 
APPLICANT: Beinkowski et al . 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND USES 

TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280M 

CURRENT APPLICATION NUMBER: US / 0 9/ 8 69 , 4 1 4 
CURRENT FILING DATE: 2001-06-27 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 

NUMBER: 60/155, 4 93 

1999-09-23 
NUMBER: 09/404, 133 
1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US99/20881 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 1 
LENGTH: 18 04 
TYPE : DNA 

ORGANISM: Homo sapiens 
US-09-869-414-1 



PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 



Query Match 100.0%; Score 1804; DB 10; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 18 04; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

M I I I I M I I I i I I I I I I I I I I I I i I I I | I | M M I 1 I I I M E I I I I I I M f I I I I I I I | 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 
1 I I E I I I I I I I I I I I I I I I M I I I I I I I I I I 1 I I I I I I I I I I I I | | | | | | | | | | | M | | | 
GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 



I M I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | If I II I I I I I I I | | | | | I I | | 



GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 
I I I I M I I I I I I I I I I M II I I I I I I I I I I I I I I I I I | | || | | | | | | | | | ( | | | | || | | | 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 



60 



120 



120 



240 



240 



Qy 


241 


GT AGACAAC C T GC AG G G G GAC TCTGGCCGC GGCT ACTAC C T G GAGAT G CT GAT C G GGAC C 

1 1 1 M 1 1 1 I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I | | | f j | | | j | | | | | | { | | M | | | { I I 1 I 
GT AGACAAC C T G C AG G G G GAC TCTGGCCGC GG CT AC T AC CT GGAGAT G CT G AT C GGGAC C 


300 


Db 


241 


300 


Qy 


301 


C C C C C GC AGAAG C T AC AGAT T C T CGT T GAC AC T GGAAGC AGTAAC T T T GC C GT G GC AGGA 

1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I | | | | | | | | | | | | | | | | | Mini 

C C C C C GC AGAAG CTACAGAT T CT C GT T GAC AC T GGAAGC AGTAAC T T T G C C GT GG C AG GA 


360 


Db 


301 


360 


Qy 


361 


AC C C C G C ACT C CT ACAT AGACAC GTAC T T T GAC AC AGAGAG GT C TAG C ACAT AC C GCT C C 

1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I 1 I | I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

AC C C C GC ACT C CT ACAT AGACAC GT ACT T T GAC AC AGAGAG GT CT AG C ACAT AC C GC T C C 


420 


Db 


361 


420 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 
N 1 ! II 1 1 1 1 1 f 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 | | | 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j I 
AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


421 


480 


Qy 


481 


GAC CT C GT CAC CAT C C C CAAAGGC T T CAAT ACT T CTTTTCTT GT CAACAT T GC C ACT AT T 

M 1 1 1 1 1 1 IE M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 I 1 I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 

GAC CTCGTCACCATCCCCAAAGGCTT CAAT ACTTCTTTTCTTGTCAACATTGCCACT ATT 


540 


Db 


481 


540 


Qy 


541 


T T T GAAT C AGAGAAT TTCTTTTTGCCT G GGAT TAAAT GGAAT GGAAT ACT T G GCC TAG CT 

N 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 I II I | | | | | | | | | | | | | | | | | | | | | | | | | 

T T T GAAT C AGAGAAT TTCTTTTTGCCTG GGAT TAAAT GGAAT GGAAT ACT T G GC C TAG CT 


600 


Db 


541 


600 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 

M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 
TAT GC CAC AC T T GC C AAGC CAT C AAGT T CT CT GGAGAC CT T CT T CGAC T C C C T GGT GAC A 


660 


Db 


601 


660 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 
f M II 1 1 II 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | | | | | | || | | | | | | | | | | | | | | | 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


661 


720 


Qy 


721 


G GAT C T G G GAC CAAC GGAGGT AGT CTTGTCTTGGGTG GAAT T GAAC CAAGT T T GT AT AAA 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

G GAT C T G G GAC CAAC GGAGGT AGT CTTGTCTTGGGTG GAAT T GAAC CAAGT TT GT AT AAA 


780 


Db 


721 


780 


Qy 


781 


GGAGACAT CTGGT AT ACC CCTATTAAGGAAGAGT GGT ACTAC CAGATAGAAATT CT GAAA 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | | | | | | | 

G GAGACAT CT GGT AT AC C C C TAT TAAGGAAGAGT G GTAC T AC CAGAT AGAAAT T CT GAAA 


840 


Db 


781 


840 


Qy 


841 


T T G GAAAT T G GAGGC C AAAGC C T TAAT CT GGACT GC AGAGAGT ATAAC G C AGACAAGGC C 

1 1 1 1 M 1 i M 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TT GGAAATT GGAGGC CAAAGC CTTAAT CT GGACT GC AGAGAGT AT AACGC AGACAAGGC C 


900 


Db 


841 


900 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 
1 M 1 1 1 1 1 1 1 II 1 II II 1 | | | | | | | | | | | | | | | | | | | | | | || | | | | | | | | | | | | | | | | , | 

ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 


Db 


901 


960 


Qy 


961 


GAAGCT GTGGCCCGCGCATCTCTGATTCCAGAATTCTCT GAT GGTTTCT GGACT GGGTCC 

1 1 1 1 j | 1 | 1 J t 1 1 1 1 1 1 1 1 1 1 1 1 I I [ 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 I I 1 1 I i i i i i r lit 

' >> I i I M if I I I I I I I I { | | | | 1 If 1 II 1 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 M 

GAAGCT GTGGCCCGCGCATCTCTGATTCCAGAATTCTCT GAT GGTTTCT GGACT GGGTCC 


1020 


Db 


961 


1020 


Qy 


1021 


CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 

1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | I I I I 

CAGCTGGCGT GCT GGAC GAATTCGGAAACACCTT GGT CTT ACT TCCCTAAAATCTCCATC 


1080 


Db 


1021 


1080 



Qy 



1081 TACCTGAGAGATGAGAACTCCAGCAGGTCATTCCGTATCACAATCCTGCCTCAGCTTTAC 114 0 



Db 


1081 


1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 I 1 I I 1 1 1 1 1 1 1 1 M I 1 I I I I | i | | | | | | | | | M | | | | 

T AC C T GAGAGAT GAGAACT C CAG CAG GT CAT T G C GT AT C ACAAT C C T G C C T CAGCTT T AC 


1140 


Qy 


1141 


AT T C AGC C CAT GAT GGGGGCCGGCCT GAAT TAT GAAT GT TAC C GAT T C GG C AT T T C C C C A 

1 1 1 1 1 1 1 1 1 1 1 1 I f 1 1 1 1 ! 1 1 II M 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 I ! I | | | | | I 1 

AT T C AGC C CAT GAT GGGGGCCGGCCT GAAT TAT GAAT GT TAC C GAT T C G GC AT T T C C C C A 


1200 


Db 


1141 


1200 


Qy 


1201 


TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 
1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 II 1 M 1 1 M i 1 1 1 1 1 1 1 I I I I I I I | | | | | | | | | | | | | | | | | | | | | 
T C CACAAAT GC GC T G GT GAT C GGT G C CACG GT GAT GGAG GG CT T CT ACGT CAT C T T C GAC 


1260 


Db 


1201 


1260 


Qy 


1261 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 
1 1 I 1 M 1 1 1 1 1 1 II 1 II 1 II 1 1 I 1 I I I I I I | | | | | | | | | | | | | | M | | | | | | | | | | | | | | 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


1320 


Db 


1261 


1320 


Qy 


1321 


GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 
1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 f I 1 1 1 
GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


1380 


Db 


1321 


1380 


Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 f I 1 1 1 1 1 1 1 1 | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1440 


Db 


1381 


1440 


Qy 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 
1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 I I I I M | || | | | | 
GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 


1441 


1500 


Qy 


1501 


C GT GAC C C T GAGGT CGT CAAT GAT GAGT C C T C T CT GGT C AGACAT CG CT G GAAAT GAAT A 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I 1 1 1 1 1 i I I I 1 1 1 1 1 
C GT GAC C CT GAGGT CGT CAAT GAT GAGT C CT CT CT GGT C AGAC AT CGCT G GAAAT GAATA 


1560 


Db 


1501 


1560 


Qy 


1561 


GC CAG GC CT GAC CT CAAGCAAC CAT GAAC T C AGC TAT TAAGAAAAT C AC ATT T C CAG GG C 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I 1 I 1 1 1 M 1 1 1 1 1 

GC CAG G C CT GAC C T CAAGCAAC CAT GAACT CAG CT AT TAAGAAAAT CAC AT T T C CAG GG C 


1620 


Db 


1561 


1620 


Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 

1 1 I 1 1 > M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 | | 1 
AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


1621 


1680 


Qy 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 

N 1 1 1 1 1 1 M 1 1 1 1 1 M f 1 1 1 1 1 I I I I I I | | | | | | 1 1 1 1 1 1 1 1 I 1 1 1 II 1 1 1 1 1 1 1 II 1 1 
GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1681 


1740 


Qy 


1741 


CT C C C TACT T C C AAGAAAAATAAT TAAAAAAAAAACT T CAT T CTAAAC C AAAAAAAAAAA 

1 1 I 1 I 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M f 1 I I I I | 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I M f 1 1 II 1 1 1 1 

CT C C CT ACT T C CAAGAAAAAT AAT TAAAAAAAAAAC T T CAT T CTAAAC CAAAAAAAAAAA 


1800 


Db 


1741 


1800 


Qy 


1801 


AAAA 1804 




Db 


1801 


MM 

AAAA 1804 





RESULT 8 
US-09-548-366-1 

; Sequence 1, Application US/09548366 
; Publication No. US2 0030 104 365A1 
; GENERAL INFORMATION: 



APPLICANT: Gurney, Mark E. 

APPLICANT: Bienkowski, Michael J. 

; APPLICANT: Heinrikson, Robert L. 

APPLICANT: Parodi, Luis A. 

APPLICANT: Yan, Riqiang 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND 

; TITLE OF INVENTION: USES THEREFOR 

FILE REFERENCE: 28341/6280A 

CURRENT APPLICATION NUMBER: US/ 0 9/ 54 8 , 3 66 

CURRENT FILING DATE: 2000-04-12 

PRIOR APPLICATION NUMBER: 60/155,493 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: 09/404,133 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: PCT/US99/2 0881 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: 60/101,594 

PRIOR FILING DATE: 1998-09-24 

NUMBER OF SEQ ID NOS : 65 

SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 1 

LENGTH: 18 04 
TYPE : DNA 
; ORGANISM: Homo sapiens 
US-09-548-366-1 



Query Match 100.0%; Score 1804; DB 10; Length 1804; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1804; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


l 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 

M M 1 I 1 II M 1 1 1 1 i I 1 1 I i I I I I I I I I M | I | j I | | t 1 1 1 1 1 1 1 I 1 1 1 I I I I | M | | i 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


l 


60 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 | | | | 1 1 1 1 1 1 1 1 1 1 1 1 | 1 1 1 1 1 1 1 M 1 1 1 1 1 1 
GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


61 


120 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I I I | | | M 1 1 1 1 1 1 1 
CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


121 


180 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 
1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 | | 1 | | | | | | I | || | | | | | | | | | | | | | j | | | | | | | | | | | | | 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


181 


240 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCT GAT CGGGACC 

M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 II 1 1 1 1 1 I I I I 1 1 1 I I 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


241 


300 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 

1 1 M 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 I 1 1 1 1 1 1 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 


360 


Db 


301 


360 


Qy 


361 


AC C C C GC ACT C CT ACAT AGACAC GT AC T T T GAC ACAGAGAGGT CT AGC AC AT AC C GCT C C 


420 



I I I I I I I M I I I I I I I I I M I I I I I I I M I I I I II I I I I I I I I M | | | | | | | | | | | | | | | 



Db 



3 61 AC C C C GC AC T C C T ACAT AGACAC GT ACT T T GACAC AGAGAG GT CT AG C ACAT AC C GCT C C 42 0 



Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

1 1 1 M 1 1 1 1 1 1 f I 1 1 1 1 I 1 1 1 1 I M ! 1 1 1 M II f 1 1 1 i 1 I 1 1 ! 1 I 1 If 1 I 1 1 1 1 1 1 M 1 I 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


. 480 


Db 


421 


480 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 

1 1 1 M 1 1 1 I 1 1 1 1 1 I 1 1 1 1 1 I I 1 1 ! | | | | | ! | | | | | | M 1 1 I 1 1 1 1 1 1 I 1 1 1 1 | | | | | M 
GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


481 


540 


Qy 


541 


TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 

1 ' 1 M ! 1 I f 1 1 1 1 M 1 1 1 II ! 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 II I 1 1 1 

T T T GAAT C AGAGAAT TTCTTTTTGCCT G GGAT TAAAT G GAAT GGAAT ACT T GGC CT AG CT 


600 


Db 


541 


600 


Qy 


601 


TAT G C C AC ACTT GC CAAG C CAT CAAGT T CT CT G GAGAC CT T C T T C GACT C C C T G GT GAC A 

1 1 M 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 I I 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


601 


660 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 
1 M 1 1 II II 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 M I I I I I I | | M 
CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


661 


720 


Qy 


721 


GGAT CTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTAT AAA 

I 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 

GGAT CTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTAT AAA 


780 


Db 


721 


780 


Qy 


781 


GGAGAC AT CT GGT AT AC C C C TAT T AAG GAAGAGT G GTACTAC CAGAT AGAAAT T C T GAAA 
1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II It 1 1 1 1 1 I 1 1 1 I I I I I I I 
GGAGAC AT C T GGT AT AC C C C TAT T AAG GAAGAGT G GTACTAC CAGAT AGAAAT T C T GAAA 


840 


Db 


781 


840 


Qy 


841 


T T GGAAAT T G GAGG C C AAAG C C T T AAT CT G GACT G C AGAGAGTATAAC G C AGACAAGGC C 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 f 1 1 1 1 1 1 1 I I I I 

T T GGAAAT T GGAGG CC AAAG C C T T AAT CT GGACT GCAGAGAGT AT AAC GC AGACAAGGC C 


900 


Db 


841 


900 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 
N 1 M M 1 1 1 1 1 1 1 1 1 1 1 I I I I | | | | | || | | | | | | | | | | | | | | | | | | | | | | | | | || | | | | 

ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 


Db 


901 


960 


Qy 


961 


GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | 

GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 


1020 


Db 


961 


1020 


Qy 


1021 


CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 


1080 


Db 


1021 


1080 


Qy 


1081 


T AC CT GAGAGAT GAGAAC T C CAGC AG GT CAT T C C GT AT C ACAAT C C T G C C T C AGCT T T AC 
1 1 1 1 i 1 1 1 1 M II 1 1 1 1 1 1 1 1 II 1 I I I I I I I | | | | | | | | | | j | | | M | | | | | | | | | | | | | 
T AC CT GAGAGAT GAGAACT CCAGC AGGT CAT T C C GT AT C ACAAT C C T GC C T C AG CT T T AC 


1140 


Db 


1081 


1140 


Qy 


1141 


AT T CAGC C CAT GAT GGGGGCCGGCCT GAAT TAT GAAT GT T AC C GAT T C G G CAT T T C C C CA 

N II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | I | | | | | | | | | | | M | | | | | | | | | | | 

ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 


1200 


Db 


1141 


1200 


Qy 


1201 


TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 


1260 


Db 


1201 


1260 



QY 


1261 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 

! 1 1 1 1 1 M 1 I 1 1 1 1 1 1 M 1 1 1 I 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 ! 1 ! 1 1 II 1 1 1 1 1 1 1 I 1 1 1 1 1 II 1 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


1320 


Db 


1261 


1320 


QY 


1321 


GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 

f 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 I I I | I | | | | | | | | | | | | | | | | 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


1380 


Db 


1321 


1380 


QY 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 
1 1 1 1 1 1 1! 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | 1 1 1 1 1 1 1 1 1 1 1 M i 1 1 1 1 1 1 I 1 1 1 
CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1440 


Db 


1381 


1440 


QY 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 
1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M I I I M 1 1 1 1 1 1 II 1 1 1 1 1 | | | | | | | | M 1 1 1 1 1 1 
GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 


1441 


1500 


QY 


1501 


C GT GAC C C T GAGGT C GT CAAT GAT GAGT C C T C T CT GGT C AGAC AT C GCT GGAAAT GAAT A 

1 ! M M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 

C GT GAC C C T GAG GT C GT CAAT GAT GAGT C CT CT CT GGT CAGAC AT C G CT GGAAAT GAAT A 


1560 


Db 


1501 


1560 


QY 


1561 


G C C AG GC C T GAC CT CAAGCAAC CAT GAACT CAG CTAT TAAGAAAAT CAC AT T T C CAG GGC 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f i 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | 1 1 1 1 1 
G C C AGGC C T GAC CT CAAGCAAC CAT GAACT CAGCT AT TAAGAAAAT CAC AT T T C CAG GGC 


1620 


Db 


1561 


1620 


QY 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCC CAC CCGTCTT CAAT CTCTGTTCT 

I 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 I I I I | | | | | | | | | | | | | | M | | | | | | | 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAAT CTCTGTTCT 


1680 


Db 


1621 


1680 


QY 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 
GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1681 


1740 


QY 


1741 


CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAAACCAAAA 

f 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | j | | | | | | | | | | | | | M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

CTCCCTACTTCCAAGAAAAATAATTAAAAA7\AAAACTTCATTCTAAA.CCAAAAAAAAAAA 


1800 


Db 


1741 


1800 


QY 


1801 


AAAA 1804 




Db 


1801 


1 II 1 

AAAA 1804 
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Local 


Similarity 99.9%; Pred. No. 0; 




Matches 1785; Conservative 0; Mismatches 1; Indels 0; Gaps 


0 


Qy 


1 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 

i i i i i i i i i i i i i i i i i i i i i i i i i i i j i i i i i i i i i i i i i i i i 

1 M 1 1 1 1 I II M 1 II 1 M 1 1 II 1 1 1 1 1 1 1 1 1 I I I I I I I | ( | | | | | | | | | | | | | | | | | | | J 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


94 


153 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 

l i i i i i i i i i i i i i i i i j i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i t i > > 

1 ( 1 1 1 1 1 1 1 1 M 1 M 1 M ! II 1 1 1 | | | | | | | | |j | | | | | | | | | | | | | | | | | | | | | | j | | | 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


154 


213 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

M 1 1 1 1 1 1 1 If 1 1 1 1 I 1 1 1 I 1 1 1 I 1 M 1 1 1 1 1 1 I 1 1 1 I I I I | | | | E 1 1 f 1 1 1 I 1 1 j i 1 1 1 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


214 


273 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 
i 1 1 M 1 1 1 1 f 1 1 f 1 1 1 II If 1 1 1 1 1 M 1 1 1 1 1 If 1 1 1 1 1 1 1 1 1 I II I | M | | | f M 1 1 1 1 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


274 


333 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 
1 1 1 f 1 M 1 1 1 II 1 II If f 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | It 1 1 1 If 1 II 1 f 1 
GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


334 


393 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 
1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 


360 


Db 


394 


453 


Qy 


361 


AC C C C GC AC T C CTAC AT AGACAC GTACT T T GACAC AGAGAG GT C TAG C AC AT AC C GCTC C 

1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 If 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 f 1 1 If | | M 1 1 1 
AC C C C G C AC T C CTAC AT AGACAC GT ACT T T GAC ACAGAGAG GT C TAG C AC AT AC C GCTC C 


420 


Db 


454 


513 


Qy 


421 


AAGGGC T TT GACGT C AC AGT GAAGT AC ACACAAG GAAG CT G GAC GGG CT T C GT T G GG GAA 

1 N f 1 1 1 1 1 1 1 1 It 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 I I I I | I | | | | | | | | | | 

AAGGGCTTT GACGT CACAGT GAAGT ACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


514 


573 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I | I I I I 
GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


574 


633 



Qy 


541 


TT T GAAT C AGAGAAT TTCTTTTTGCCT GG GAT T AAAT GGAAT GGAAT AC T T GGC CT AGC T 

i 1 M M 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 I M I I I I I | | M li 1 1 1 1 II 1 1 1 1 I 1 I | | I I I 

T T T GAAT C AGAGAAT TTCTTTTTGCCT GG GAT T AAAT G GAAT GGAAT AC T T GGC C T AGC T 


600 


Db 


634 


693 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 
1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 If 1 1 1 1 1 I 1 1 I II I | | I I I 
TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


694 


753 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 

1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 I 1 1 1 1 1 1 1 1 1 
CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


754 


813 


Qy 


721 


GGAT C T G G GAC CAACG GAGGT AGT CTT GT CT T GG GT GGAAT T GAAC CAAGT T T GT ATAAA 
M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 I 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | | | | f | | | | 
GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 


780 


Db 


814 


873 


Qy 


781 


GGAGACAT CT GGT AT AC C C CT AT TAAG GAAGAGT G GT ACT AC CAGATAGAAAT T C T GAAA 

' 1 1 1 1 1 1 M 1 1 1 f 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I I I | | | M 1 1 1 1 1 1 1 1 1 1 1 I I I I 

G G AGACAT CT GGT AT AC C C C TAT T AAGGAAGAGT G GT ACT AC CAGATAGAAAT T C T GAAA 


840 


Db 


874 


933 


Qy 


841 


T T GGAAAT T GGAGGC CAAAGC CT T AAT CT G GACT G C AGAGAGT AT AAC GCAGACAAGG C C 

M 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I I I I | | | | | | | | | | | | | | | | | | | m 1 1 1 1 1 1 1 1 1 1 1 1 

T T GGAAAT T GGAGGC CAAAG C CT T AAT C T G GACT GCAGAGAGT AT AAC GCAGACAAG GC C 


900 


Db 


934 


993 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 
1 1 1 1 1 1 1 1 1 M 1 1 1 If 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | | | | | | | | 
ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 


Db 


994 


1053 


Qy 


961 


GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 
1 N 1 II 1 1 1 1 1 1 1 1 M 1 1 1 II i 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 f M 1 1 1 1 1 1 1 I I I 
GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 


1020 


Db 


1054 


1113 


Qy 


1021 


CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 


1080 


Db 


1114 


1173 


Qy 


1081 


T AC CT GAGAGAT GAGAACT CCAGCAGGT CAT T CC GT AT CACAAT CCT GCCT CAGCTTT AC 

MMIIMMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I | | | | | E 1 1 
T AC CT GAGAGAC GAGAACT CCAGCAGGT CAT T C C GT AT CACAAT C C T GC CT C AG CTT T AC 


1140 


Db 


1174 


1233 


Qy 


1141 


AT T C AGC C CAT GAT GGGGGCCGGCCT GAAT TAT GAAT GT T AC C GAT T C G GCAT T T C C C C A 

N 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 I I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 II 1 M M 1 

ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 


1200 


Db 


1234 


1293 


Qy 


1201 


T CC ACAAAT G C GC T GGT GAT C G GT G C C AC GGT GAT GGAGG G C TT CT AC GT CAT CT T C GAC 

1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I M 1 1 1 1 

TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 


1260 


Db 


1294 


1353 


Qy 


1261 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 

1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I | | | I | | | | | | 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


1320 


Db 


1354 


1413 


Qy 


1321 


GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 

1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 I M I I || I | | | | | | | | | | | | M | | | | i | | | | | | | | | | | 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


1380 


Db 


1414 


1473 


Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1440 



Db 


1474 


M 1 M 1 1 1 1 1 f 1 i 1 1 1 1 1 1 1 I 1 1 I I || I I | | | | | | | | | | | | | | | | | | | | | | J | | | 

CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1533 


Qy 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 
1 1 1 1 1 i M 1 1 I 1 1 1 1 M II 1 1 1 1 1 1 | | | || | | | | | | | | | | | | | | | | | | | | | J | | | | | M 1 
GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 


1534 


1593 


Qy 


1501 


C GT GAC CC T GAGGT C GT CAAT GAT GAGT C C T CT C T GGT CAGAC AT C G C T GGAAAT GAAT A 

1 I 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I 1 1 1 

C GT GAC C C T GAGGT C GT CAAT GAT GAGT C C T CT CT GGT CAGAC AT C GCT GGAAAT GAAT A 


1560 


Db 


1594 


1653 


Qy 


1561 


G C C AGGC CT GAC C T CAAG CAAC CAT GAACT C AG CT AT TAAGAAAAT CAC AT T T C CAG G GC 

1 1 1 ' M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 I 1 1 1 I I I 1 1 1 1 

GC C AG GC CT GAC C T CAAG CAAC CAT GAACT C AGC T AT TAAGAAAAT C ACAT T T C CAG GGC 


1620 


Db 


1654 


1713 


Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 
AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


1714 


1773 


Qy 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 

1 1 J 1 1 1 1 1 1 1 1 lillllillti iii iiii i i i i ii hi*. 

1 1 1 1 1 1 1 ' 1 M 1 1 1 1 1 1 1 1 1 I 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 1 1 1 1 1 1 1 M 1 1 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1774 


1833 


Qy 


1741 


CT C C C T ACT T C CAAGAAAAAT AAT T AAAAAAAAAAC T T CATT C T AA 17 8 6 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 I 1 1 1 1 1 I I M 1 II 1 1 1 1 1 1 1 1 1 1 

C T C C C T ACT T C CAAGAAAAAT AAT T AAAAAAAAAAC T T CATT CTAA 18 7 9 




Db 


1834 
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Qy 


i 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 
1 1 1 i ! I 1 1 I 1 | f | | | f | I | | | M 1 t 1 i I 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | | | 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


94 


153 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 
1 ' 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


154 


213 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 
1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | | | | | | | | | | | | | | 
CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


214 


273 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 

' 1 N 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


274 


333 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 

1 1 1 1 f 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 | M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


334 


393 


Qy 


301 


C C C CC GCAGAAG CT ACAGAT T CT C GTT GAC AC T G GAAGCAGTAACT T T G C C GT GG C AG GA 

( 1 i M 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I | | | | 1 1 1 1 1 1 1 1 1 1 1 1 I I I I f I | | | | | 1 1 II 1 1 1 1 1 1 I 
C C C CC GCAGAAG C T AC AGAT T C T C GT T GAC ACT G GAAGCAGTAACT T T GC C GT GGCAGGA 


360 


Db 


394 


453 


Qy 


361 


AC C C C GCACT C CT AC ATAGAC AC GTAC T T T GAC ACAGAGAG GT CTAG C ACAT ACC GC T C C 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 | | | | | 1 1 1 1 II 1 1 1 1 1 1 | | 

ACCCCGC ACT CCT ACAT AGACACGTACTTTGACACAGAGAGGTCTAGC ACAT AC CGCTCC 


420 


Db 


454 


513 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 
1 1 1 N 1 1 1 1 II I II 1 II M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 ! | | || | | | | | | | | M | (J | | | | 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


514 


573 


Qy 


481 


GAC CT C GT C AC CAT C C C C AAAG GCT T CAAT ACT T CTTTTCTTGT CAAC AT T GC C ACT AT T 
1 1 1 N 1 f 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 | | | | M | | | | | | | | | 
GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


574 


633 


Qy 


541 


TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 t t II 1 1 1 1 1 1 1 1 1 1 II 
T T T GAAT C AGAGAAT T T CT TTTTGCCT GGGAT T AAAT G GAAT GGAAT ACT T GG C CTAGCT 


600 


Db 


634 


693 


Qy 


601 


TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 

1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 I 1 1 1 i 1 1 I I I I I 1 1 1 1 1 1 1 1 I | | | | | | | M 1 1 1 1 1 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


694 


753 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 ' 1 1 1 M 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


754 


813 


Qy 


721 


G GAT C T GGGAC CAAC G GAG GT AGT CT T GTCTTGGGTG GAATT GAAC C AAGT T T GT ATAAA 

1 1 1 1 1 1 1 1 1 I M 1 1 1 1 M 1 1 II 1 I 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 II 1 1 1 I 1 I 1 1 1 1 1 1 1 1 1 1 1 1 


780 



814 G GAT C T GG GACCAAC GGAG GT AGT CT T GT C T T GGGT G GAAT T GAAC CAAGT T T GT AT AAA 873 

781 GGAGAC AT C T GGT AT AC C C C TAT T AAGGAAGAGT G GTAC TAC CAGAT AGAAAT T CT GAAA 84 0 

I I I M i I I I I I I II II I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | || || | | | | | | 
874 GGAGAC AT CT GGT AT AC C CCTATT AAGGAAGAGT GGTACTAC CAGATAGAAATT CT GAAA 933 

841 T T G GAAAT T GGAG GC CAAAG C C T T AAT CT GGAC T G CAGAGAGTAT AAC G C AGACAAGG C C 900 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I 

934 T T GGAAAT TG GAGG C CAAAG C CT T AAT CT GGACT G CAGAGAGTAT AAC G C AGAC AAG GC C 9 93 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M | | | | | M I I I I I I 

994 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1053 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 

N I M I I I I I II I I I I I I f I I I I I I I I I I I I II I I II I I I I I I I I | I I I I I I I I I | | | M 
1054 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCT GAT GGTTTCT GGACT GGGT CC 1113 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1080 

> I I I I 1 I I I I M I I M I I I I I I I I I I I I I | | | | | | M | I I I I I I I I I I I I | | | | | I I I I I 
1114 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1173 

1081 TAC CT GAGAGAT GAGAACT CCAG C AGGT CAT T C C GT AT C ACAAT C CT GC C T CAGC T T TAC 114 0 

I I I I I I I I I I I I I I II I | | M I I I I I I I I I I I | | | | | | | M I I I I I I I 

1174 TAC C T GAGAGAC GAGAACT CC AGC AGGT CAT T C C GTAT C ACAAT C C T GC CT CAG CT T TAC 1233 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 00 

I I t I I N I I I I I I I I I I I I I I I I | | | M I I I I I I I I I I I I I I I I I I I I I I I I I I 

1234 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 93 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1260 

I I I I I I I I I I M I I M I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | 

12 94 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1353 

12 61 AGAGC CC AGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | I I I 
1354 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1413 

1321 GTGTCT GAAAT TTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 1380 
I I I I I I I I I I I I I I I | | | | | | | | M | | | | | | | | | | | | | | | | | | | | | | | | | , 

1414 GTGTCT GAAAT TTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGT GTCCCCGCT 14 73 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

t I I I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I I I | | | | | | | j M I I I I I I I I I I 
1474 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1533 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

I I I I I I I I I I I I I I I I I I f | I | | I I I I I I I I I I I I M I I I I I I I I M | | | | | | | | | | | | | 
1534 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1593 

1501 C GT GAC C CT GAG GT C GT CAAT GAT GAGT CCT C T CT GGT C AGAC AT C GCT GGAAAT GAAT A 15 60 

N I I I M I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1594 C GT GACCCT GAGGT CGT CAAT GAT GAGT CCT CT CT GGT CAGACATC GCT GGAAAT GAAT A 1653 

1561 G C C AGGC C T GAC CT CAAGC AAC CAT GAACT C AG CT AT TAAGAAAAT CAC AT T T C C AG GGC 1620 

I > M I I I I I I I I I I I I I I I I I I I I I I I I | | I | | | | | | | | M I I I I I I I I I | | | | | | I I I I 
1654 G C C AGGC C T GAC CT CAAGCAAC CAT GAACT C AG CT AT TAAGAAAAT CAC AT T T C CAG GGC 1713 



Qy 1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

I I I I I I M I I i I I I M M I M I M I I I I I i I I I I I I M I I I ( I I I I I I I I I I M II I I M 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 1773 



Db 1714 



Qy 1^81 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1740 

I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I II 
Db 1774 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1833 

Qy 17 41 CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAA 17 8 6 

I I M I I I I I M I I I I I I I I I I I I I I | I | I I I I I I I I I I I I I 1 I I I I 
Db 1834 CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAA 187 9 
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Qy 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

I ' I I I M I M I I I I I I I I I I I I I I I I I I j I I I I | I | | | f I l | | | I | | | | | | f | | | | | | | | 

Db 94 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 153 

QY 61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

Hf I I I I I I I i If I I I I I I I | | | | | | | | | | | || | | | M | | | | | | | | 

Db 154 GCC CCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 213 



121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

I M I I I I I I I I | | | | | | | | | | | | | | | | | | | M I I I It I I I I I I | | | | | | | | | | | | | | | | | 
214 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 273 

181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | I I I I I I I I I I | | | | I I I I I 
274 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 333 

241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

I ' I I I I I I II I M I I I I I I I I I I II | | | M I I I I I I I I I I I I I I I I I I | | | | | | M I I I I 
334 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 393 

301 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 3 60 

I I I I I M I I I I I I I I I I M i I I I I I I I I I I I I I I I I I I I I I I M M I M I I I I I I M I I I 

394 CC C C C G C AGAAG C T ACAGATT C T CGT T G AC ACT GGAAGCAGTAACT TT GC C GT GG C AGGA 453 

3 61 AC C C C GCAC T C C T AC AT AGAC AC GT ACT T T GAC AC AGAGAGGT CT AGCAC AT AC C GCT C C 42 0 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | I I I I I I I I I I I I I | | | | | II I I 

4 54 AC C C C G C AC T C CTAC AT AGAC AC GT ACT T T GAC AC AGAGAGGT C TAG C AC AT AC C GC T C C 513 

421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 4 80 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I 
514 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 573 

481 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 540 

I I I I M I I I I II I II I I I | | | | | | | | | M I I I I I I I I I I I I || | | | | | I I I I I If 

574 GAC C T C GT C AC CAT C C C C AAAGG C T T CAAT AC TTCTTTTCTT GT C AAC AT T GC C ACT AT T 633 

541 TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 600 

I I M I I I I II I I I I I I II I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I j I I I I I I I | 
634 T T T GAAT CAGAGAAT TT CT T TTTGCCTGG GAT T AAAT G GAAT G GAAT ACT T GGC CTAGCT 693 

601 TAT GC C ACAC T T GCCAAGC CAT CAAGT T C T C T GGAGAC CT T CT T C GAC T C C C T G GT GAC A 660 

M I I ! I I I 1 I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

694 TAT GC C ACACT T GCCAAGC CAT CAAGT T C T C T G GAGAC CT T CTT C GAC T C C CT GGT GAC A 753 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 720 

I I I I i N I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I | | | | | | | | | | | | 
754 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 813 

721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 78 0 

N I I M I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | M I I I I I I I I I | | | | | | | | 
814 GGATCTGGGACCAACGGAGGTAGT CTT GT CTT GGGTGGAATTGAAC CAAGT TTGTAT AAA 873 

781 G GAGAC AT CT GGT AT AC C C CT AT T AAGGAAGAGT GGT AC T AC C AGAT AGAAAT T C T GAAA 84 0 

f I I I I I I I M I I I I I I I I I I I | | | | | || | | | | | | | | | | | | | | | | | | | | , | | | | | | | | | | | 
874 G GAGAC AT CT GGT AT AC C C CT AT TAAG GAAGAGT GGT ACT AC C AGAT AGAAAT T C T GAAA 933 

841 T T GGAAATT GGAGGC C AAAGC CT T AAT C T GGACT G CAGAG AGT AT AAC GC AGACAAG G C C 90 0 

I N I I 1 ! II I I I I I I I I I | | | | | I I II I I | I I I M I I I I I I I I I I I I I I I I I I 

934 T T GGAAAT T G GAGGC CAAAG C CT TAAT C T GGAC T GCAGAG AGT ATAAC G C AGACAAGGC C 993 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

I N I I M I I I I I I I I I | | | | | | | I I I I I I I I I I I I I I I I M I I I I I I I I I II I I I I I I I I 
9 94 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1053 



9 61 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 

" I M M I I I I M I I I I I I I I I I I ! I I I I I I f ! M I M I I I I I I j I I | I | I M I I 1 I I M 

1054 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1113 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

I I I I I N I I I I I I I I I I I I I I | | | I I M I I I I I I I I I I I I I I I I I I I I I I | I I I 

1114 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1173 

10 81 T AC C T GAGAGAT GAGAAC T C CAGC AGGT CAT T C C GT AT CAC AAT C CT G C C TC AGC T T T AC 114 0 

M I I I I I I I I I I I I I I I I I I I I I I I I | | I I I I I I I I I I I I | | | | I I I I I I I I I I I I II I 
117 4 T ACC T GAGAGAC GAGAAC T C C AGCAGGTCAT T C C GT AT C ACAAT C C T GC CT C AG CT T T AC 1233 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 12 00 

1 I I I M I I I I I I I I I I I I | | I I I I I I I I I I I I I I I I II I I I I I I I I I M I II I I I I I I I I 
1234 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1293 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 6 0 

' I I I M I M I M I I M I I I I I I II I I I I I I I I I I I I I I I I I I I | I II I I I I I I I I I I M I 

12 94 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1353 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

"' I M I It I I I I I I I I I I I M I I I I I I | | | | | ! | | | | | | I I I I I I | | M II I I I I | | | | 
1354 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1413 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 1380 

' ' I I I I I I f I I M I I I I | | | | | | | | | | I I M I I I II I II I I I I I I I I I | | I I I I 

1414 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 14 7 3 

1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 144 0 

1 " I I I I I I I I I I I I I I | | | | | | | I I I I I I I I I I II I M I I I I I I I I I I I I I I I I I I I I I 
1474 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1533 

14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 150 0 

" I I I I M I I I I I I I I I I M M I I I I I I I I I I I I | | | | I I I I I I I I I I | II 

1534 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 15 93 

1501 C GT GAC C C T GAGGT C GT CAAT GAT GAGT C CTC T C T GGT CAGAC AT C GC T G GAAAT GAAT A 1560 

j >HI I I I II I II I I I I I I I I I | | | | | | | | | | | | | | | | || | | | | | | | | | | | , 

1594 C GT GAC C C T GAGGT C GT CAAT GAT GAGT CCTCTCTG GT CAGACAT C G CT GGAAAT GAAT A 1653 

1561 G C C AG G C CT GAC CT CAAG CAAC CAT GAACT CAGCT AT TAAGAAAAT C AC AT TT CC AGGG C 162 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | || | | | | | | | | | | | 

1654 G C C AG G C C T GAC C T CAAG CAAC CAT G AAC T C AG C TAT TAAGAAAAT CAC AT T T C C A G G G C 1713 

1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

1 ' M I I I I I I I I M I I M I I I I I I I I I I I I I I I f | | | | | | | I I I I I I I I I I I I I I II I I I 
1714 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 1773 

1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

' I I I I I I I I I I I I I I I I I I I I I I I I I M | | | I I I I I I I I I I I I | | | | M I I I I I I I I II I 
1774 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1833 

1741 CT C C C T AC T T C CAAGAAAAAT AAT TAAAAAAAAAACT T CAT T C TAA 17 8 6 

I I I I I I I I I I I I f I I I I I f II I I | | | | | | | | | | | 

1834 CTC C CT ACT T C CAAGAAAAAT AAT TAAAAAAAAAAC T T CAT T CTAA 1879 
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Qy 


181 


Db 


274 


Qy 


241 


Db 


334 


Qy 


301 



ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

I I I I I I I I I I I I I I I I I I I I | | | | I I M I I I M I I I I I I I I I I I I I | | | | | | | 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 153 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 
1 I I I I I I I I I I I I I I I I I I I I || | | | | | I | | | | | || | | | | | | | , | | | , | | 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 213 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 
I I I I I I I I I I I | | | | | | | | | | | | | || | | | | | | | | | ||,,, 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 273 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 2 4 0 

I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | I I I I I I i I I I I I I I I I I I 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 333 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 300 

' f I I I I M I I I I I I Ml I I I ! I I I I I I I I I I I I I I I I I I I I 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 393 



CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGT 



GGCAGGA 360 



394 CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 453 

361 AC C C C G C ACT C CT ACATAGACAC GT ACT T T GACACAGAGAG GT C T AGC AC AT AC C GC T C C 42 0 

' f I I I I I I f I I I I I 1 I I I I I I I I I I I i I f M I I I I f | | | M I | f M I I I I II I ! | | f | | f 
4 54 AC C C C G C ACT C CT AC AT AGACAC GT AC T TT GACACAGAGAG GT C TAG CAC AT AC C GCT C C 513 

421 AAGG GCT T T GAC GT CACAGT GAAGT AC AC AC AAG GAAG CT G GAC GG G C T T C GT T GG GGAA 48 0 

" I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I I I I I 

514 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 573 

481 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

N I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | I I I I I I I I I I I I I I I I I | | | | | 
574 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 633 

541 TT T GAAT CAGAGAATT T CT T T T T GC CT GGGAT T AAAT G GAAT G GAAT ACTT G G C CT AGCT 600 

N M I I I I I I I I I I I I I I I | | | | | | | | I I I I I I | | | | | | | I I I I I II I I I I I I I I | | I f | 
634 TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 693 

601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 

" I ! M I I I I I I M f I I M I I I I I I I I I I I I I M I I I II I I I I I I I I I I M I I 

694 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 753 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

N I M I I I I I I I I I I I I I I I | | | | | | | | | | f | | | | | I I I I I I I I I I I I I I I I I M I I I I I 
75 4 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 813 

721 GGAT C T GG GAC CAAC GGAG GT AGT CT T GT CTTGGGTG GAAT T GAAC CAAGT T T GT ATAAA 78 0 

N I I I I I I I I I I I I I I | | II I I I I I I I I I I I I I I I I I I 

814 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 87 3 

781 GGAGAC AT CT GGT AT AC C C CT AT TAAGGAAGAGT G GT ACT AC C AGAT AGAAAT T CT GAAA 84 0 

' I M I I I I M I I I I I I I I I | | | | | | | | M I I I I I M I I I I I I | | | | | I I I I I I I I I I I | f 

8 74 G GAGAC AT CT G GT AT AC C C C TAT TAAGGAAGAGT GGT AC T AC C AGAT AGAAAT T C T GAAA 933 

841 T T GGAAAT T G GAGGC CAAAGC CT TAAT CT GGACT G CAGAGAGT AT AAC GC AGAC AAG G C C 900 

I I I I I N I I I I I I I I I I | | | | I I I I I I I I I I I I I I I | | | If I I I I I I I I I I I I I I I I I I I 
934 T T GGAAAT T GGAGGC CAAAGC CT TAAT C T GGAC T GCAGAGAGT ATAAC G C AGACAAGGC C 993 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 
I I I I I I I I I I I I I I f I I I [ | | | | | I I I I I I I | | | I I f I I I I I I I M I I I I I I I I I I I I I | 

9 94 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1053 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1020 

I ' I I I I ' N I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | I I I I I | | | | | | | | | | 

1054 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1113 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1080 

1 I I ' I I I I I I I I I I I I I I I I I I I I I I | | | | I f I I I I I I I I I | | | | | | | | | | | | | 

1114 CAGCTGGCGT GCT GGACGAATTCGGAAACACCTT GGT CTT ACTT CCCTAAAATCTCCATC 1173 

1081 T AC CT GAGAGAT GAGAACT C CAG CAGGT CAT T C C GT AT CACAAT C CT GC CT C AG CT T T AC 114 0 

I 1 1 I ' I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | I I I I I I I I I I I I I 
1174 T AC CT GAGAGAC GAGAAC T C C AGC AGGT CAT T C C GT AT CACAAT C CT GC CT CAG CTT T AC 1233 

1141 ATT CAGC C CAT GAT GGGGGCCGG C CT GAAT TAT GAAT GT T AC C GATT C GGC AT T T C C C CA 12 00 
I N I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I I I I I I I I I I I I I I | I I I I I I I 



Db 


1234 


ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 


1293 


Qy 


1201 


TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 
1 1 M 1 1 I 1 1 1 1 I I 1 I I I f | | | | | M 1 1 1 1 ! 1 | | | | | | | | f | M | | | ! | | | | | | | | | | | | | 

TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 


1260 


Db 


1294 


1353 


Qy 


1261 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 

' N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 | | | | | | | | | | | 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


1320 


Db 


1354 


1413 


Qy 


1321 


GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 

1 N M I 1 M M 1 1 1 1 1 1 1 1 1 ! I M 1 1 1 1 1 1 1 1 1 I | 1 1 1 1 1 M 1 II 1 1 1 M 1 1 If 1 1 1 1 1 1 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


1380 


Db 


1414 


1473 


Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 
' 1 1 M 1 1 1 1 1 1 1 1 1 I I I | | 1 1 1 1 1 1 1 1 1 1 | If | | | | | | | | | | j | | | | | | 1 1 1 1 1 1 1 1 1 1 I 
CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 


1440 


Db 


1474 


1533 


Qy 


1441 


GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 
1 N 1 1 II 1 1 1 1 1 1 1 1 1 1 1 | | | | | 1 | | | | | 1 1 1 1 f 1 I I 1 1 1 1 1 1 1 1 1 | | | 1 1 1 1 1 1 1 1 1 1 1 
GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 


1500 


Db 


1534 


1593 


Qy 


1501 


C GT GAC C C T GAG GT C GT CAAT GAT GAGT CCTCTCTG GT C AGACAT C GCT GGAAAT GAAT A 

' M 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I 

C GT GAC C C T GAG GT C GT CAAT GAT GAGT C CT CT CT GGT C AGACAT C GC T GGAAAT GAAT A 


1560 


Db 


1594 


1653 


Qy 


1561 


G C C AGGC CT GAC CT CAAG CAAC CAT GAACT CAGCT AT TAAGAAAAT C AC AT T T CCAGG GC 

1 N 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I 

GC C AG G C C T GAC CT CAAGCAAC CAT GAACT CAGCT AT TAAGAAAAT C AC AT T T CC AG GG C 


1620 


Db 


1654 


1713 


Qy 


1621 


AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 

' 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 M 

AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 


1680 


Db 


1714 


1773 


Qy 


1681 


GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 
| | 1 | 1 I II 1 1 1 1 1 1 | 1 1 | 1 1 1 1 | 1 | 1 | 1 | | i t t i i i t i i f ii i i i i i i i i i i i ) 

GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 


1740 


Db 


1774 


1833 


Qy 


1741 


C T C C C T AC T T C C AAGAAAAAT AAT T AAAAAAAAAACT T CAT T CT AA 17 8 6 
1 1 1 1 1 1 1 1 1 1 If 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 I 1 1 1 | f I I | 
CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAA 187 9 




Db 


1834 





RESULT 13 
US-09-978-189-195 

Sequence 195, Application US/09978189 
Publication No. US2 0 03 0004 102A1 
GENERAL INFORMATION: 
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APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
TITLE OF 



Gerber, Hanspeter 
Gerritsen, Mary E. 
Goddard, Audrey 
Godowski, Paul J. 
Grimaldi, J. Christopher 
Gurney, Austin L. 
Hillan, Kenneth J 
Kljavin, Ivar J. 
Kuo, Sophia S. 
Napier, Mary A. 
Pan, James; 
Paoni, Nicholas F. 
Roy, Margaret Ann 
Shelton, David L. 
Stewart, Timothy A. 
Tumas, Daniel 
Williams, P. Mickey 
Wood, William I. 
INVENTION: Secreted and Transmembrane Polypeptides 



TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2630P1C7 

CURRENT APPLICATION NUMBER: US/ 0 9/ 97 8 , 18 9 
CURRENT FILING DATE: 2001-10-15 
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PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 

PRIOR 



APPLICATION 
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APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
APPLICATION 
FILING DATE 
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FILING DATE: 
APPLICATION 
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APPLICATION 
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APPLICATION 
FILING DATE: 
APPLICATION 
FILING DATE: 
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APPLICATION 
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APPLICATION 
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APPLICATION 
FILING DATE: 
APPLICATION 
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NUMBER: 09/918585 
: 2001-07-30 
NUMBER: 60/062250 
: 1997-10-17 
NUMBER: 60/06424 9 
: 1997-11-03 

NUMBER: 60/065311 
: 1997-11-13 
NUMBER: 60/066364 
: 1997-11-21 
NUMBER: 60/077450 
: 1998-03-10 
NUMBER: 60/077 632 
: 1998-03-11 
NUMBER: 60/077641 
: 1998-03-11 
NUMBER: 60/077649 

1998-03-11 
NUMBER: 60/077791 

1998-03-12 
NUMBER: 60/078004 

1998-03-13 
NUMBER: 60/078886 

1998-03-20 
NUMBER: 60/078936 

1998-03-20 
NUMBER: 60/078910 

1998-03-20 
NUMBER: 60/078939 

1998-03-20 
NUMBER: 60/079294 

1998-03-25 
NUMBER: 60/079656 
1998-03-26 



PRIOR APPLICATION NUMBER: 60/079664 
PRIOR FILING DATE: 1998-03-27 
; PRIOR APPLICATION NUMBER: 60/079689 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079663 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079728 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079786 
PRIOR FILING DATE: 1998-03-27 
; PRIOR APPLICATION NUMBER: 60/079920 
PRIOR FILING DATE: 1998-03-30 
PRIOR APPLICATION NUMBER: 60/079923 
PRIOR FILING DATE: 1998-03-30 
PRIOR APPLICATION NUMBER: 60/080105 
; PRIOR FILING DATE: 1998-03-31 

PRIOR APPLICATION NUMBER: 60/080107 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080165 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080194 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080327 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/080328 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/080333 
; PRIOR FILING DATE: 1998-04-01 

PRIOR APPLICATION NUMBER: 60/080334 
; PRIOR FILING DATE: 1998-04-01 

PRIOR APPLICATION NUMBER: 60/081070 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081049 
; PRIOR FILING DATE: 1998-04-08 

PRIOR APPLICATION NUMBER: 60/081071 
; PRIOR FILING DATE: 1998-04-08 

PRIOR APPLICATION NUMBER: 60/081195 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081203 
PRIOR FILING DATE: 1998-04-09 
PRIOR APPLICATION NUMBER: 60/081229 
PRIOR FILING DATE: 1998-04-09 
PRIOR APPLICATION NUMBER: 60/081955 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081817 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081819 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081952 
; PRIOR FILING DATE: 1998-04-15 

PRIOR APPLICATION NUMBER: 60/081838 
; PRIOR FILING DATE: 1998-04-15 

PRIOR APPLICATION NUMBER: 60/082568 
PRIOR FILING DATE: 1998-04-21 
PRIOR APPLICATION NUMBER: 60/082569 
PRIOR FILING DATE: 1998-04-21 
; PRIOR APPLICATION NUMBER: 60/082704 



PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082804 
; PRIOR FILING DATE: 1998-04-22 

PRIOR APPLICATION NUMBER: 60/082700 
PRIOR FILING DATE: 1998-04-22 
; PRIOR APPLICATION NUMBER: 60/082797 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER : 60/082796 
PRIOR FILING DATE: 1998-04-23 
PRIOR APPLICATION NUMBER: 60/083336 
PRIOR FILING DATE: 1998-04-27 
PRIOR APPLICATION NUMBER: 60/083322 
PRIOR FILING DATE: 1998-04-28 
PRIOR APPLICATION NUMBER: 60/083392 
PRIOR FILING DATE: 1998-04-29 
; PRIOR APPLICATION NUMBER: 60/083495 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083496 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083499 
; PRIOR FILING DATE: 1998-04-29 

PRIOR APPLICATION NUMBER: 60/083545 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083554 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083558 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083559 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083500 
; PRIOR FILING DATE: 1998-04-29 

PRIOR APPLICATION NUMBER: 60/083742 
PRIOR FILING DATE: 1998-04-30 
PRIOR APPLICATION NUMBER: 60/084366 
PRIOR FILING DATE: 1998-05-05 
PRIOR APPLICATION NUMBER: 60/084414 
PRIOR FILING DATE: 1998-05-06 
PRIOR APPLICATION NUMBER: 60/084441 
PRIOR FILING DATE: 1998-05-06 
PRIOR APPLICATION NUMBER: 60/084637 
; PRIOR FILING DATE: 1998-05-07 

PRIOR APPLICATION NUMBER: 60/084639 
PRIOR FILING DATE: 1998-05-07 
; PRIOR APPLICATION NUMBER: 60/084640 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084598 
; PRIOR FILING DATE: 1998-05-07 

PRIOR APPLICATION NUMBER: 60/084600 
; PRIOR FILING DATE: 1998-5-07 
; PRIOR APPLICATION NUMBER: 60/084627 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084643 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/085339 
; PRIOR FILING DATE: 1998-05-13 

PRIOR APPLICATION NUMBER: 60/085338 
PRIOR FILING DATE: 1998-05-13 



; PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 

; PRIOR APPLICATION 
PRIOR FILING DATE 

; PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 

; PRIOR FILING DATE 
PRIOR APPLICATION 



NUMBER: 60/085323 
: 1998-05-13 
NUMBER: 60/085582 
: 1998-05-15 
NUMBER: 60/085700 
: 1998-05-15 
NUMBER: 60/08568 9 
: 1998-05-15 
NUMBER: 60/085579 
: 1998-05-15 

NUMBER: 60/085580 
: 1998-05-15 
NUMBER: 60/0 8 5573 
: 1998-05-15 
NUMBER: 60/085704 
: 1998-05-15 
NUMBER: 60/085697 



Query Match 98.9%; Score 1784.4; DB 10; Length 1879; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1785; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 

M 1 1 1 1 I 1 1 1 1 I 1 f 1 1 1 1 1 1 1 1 I 1 1 1 I I f| 1 1 1 M 1 1 1 1 M 1 1 i 1 1 I 1 1 1 1 1 1 I I I I I | | 

ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


94 


153 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 

f 1 1 1 f M ! 1 1 M 1 i 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 I 1 1 1 1 f 1 1 M M 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


154 


213 


Qy 


121 


CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 
1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | || | | | | | | | | | | | | | | | | | || | | | 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


214 


273 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 
1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | | | | | | | | | M 1 1 1 1 1 1 1 1 1 1 1 I I I I I | | | | | | | 
GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


274 


333 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 

N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 I 1 1 1 1 1 1 1 1 1 | | j | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


334 


393 


Qy 


301 


CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 

M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I 1 1 1 1 1 M 1 1 1 1 1 1 1 | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 
CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 


360 


Db 


394 


453 


Qy 


361 


AC C C C GCAC T C CT AC AT AGACAC GT ACT T T GACAC AGAGAGGT C TAG C AC AT AC C GCT C C 

1 1 1 1 N I II 1 1 1 1 M 1 1 1 1 1 I I I I I M | | | | | | | | j | | M II 1 1 1 1 1 1 1 1 1 1 1 II 

AC C C C G CACT C CT AC AT AGACAC GT AC T T T GACAC AGAGAGGT C TAG C AC AT AC C GCT C C 


420 


Db 


454 


513 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

1 1 ' 1 1 1 1 N 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 II M 1 E 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


514 


573 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 

1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 I 1 1 1 1 1 1 f 1 1 1 1 1 1 1 
GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


574 


633 



541 TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 600 

I I I I M I I I I I I I I I I ( I I I I | | | | | | | | M | | | | | | | | | | | | | | | | | | | , , | , 

634 T T T GAAT C AGAGAAT TTCTTTTTGCCT GG GAT TAAAT GGAAT G GAAT AC T T GGC CT AG C T 693 

601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 660 
N I I I I I I I I I I I I I I I I I I t II I I I I I I ( | | | | | | | | | | | | | | | | ( | | | | | | | | | | | | | 

694 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 753 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

i I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I | | | | | | I I I I I I I I I 
754 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 813 

721 G GAT CT G G GAC CAAC GGAGGT AGT CT T GT C T T GGGT G GAAT T GAAC CAAGT T T GTAT AAA 78 0 

I I I t I I I i I I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I | | | | | M | | | | 

814 GGATCTGGGAC CAAC GGAGGTAGTCTTGTCTT GGGT GGAATTGAACCAAGTTT GTAT AAA 87 3 

7 81 G GAGACAT C T GGT AT AC C C C TAT T AAG GAAGAGT GGT AC T AC CAGAT AGAAAT T CT GAAA 84 0 

I I I 'I I I I I I I I I I I I I | | M I I II I I I I I I I I I I | | | | | | | f | | | I I I I I I I I I | | | | | 
874 GGAGACAT CT GGTATACCCCTAT TAAGGAAGAGT GGTACTAC CAGATAGAAATT CT GAAA 933 

841 T T G GAAAT T G GAG G C CAAAG C CT TAAT CT G GAC T GC AGAGAGTATAAC GC AGACAAGG C C 900 

HI I N I I I I I I I I I I I I I I I | | M I I I I I I I I I I I I I I I I | | | | | | 

934 TTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 9 93 

9 01 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

I ' I I ' I N I I I I I I I I I I I I I | | | || M | | | M I I I I I I I I I I I | | | | | | | || M 

994 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1053 

961 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 102 0 
I I I I I I I ' I I I M I I I I I I M I I I I I I I | | | | | | | | | | [I I I I I I I I I | | | I I I I I I I I I 
105 4 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCT GAT GGTTTCTGGACT GGGT CC 1113 

1021 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 108 0 

I I I I N I I I I I I I II I I | | | | | | | | | | | | | | | | M | | | | | I I I I I I I I I I I I I I I 

1114 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 1173 

1081 T AC C T GAGAGAT GAGAAC T C CAGC AGGT CAT T C C GT AT CACAAT C C T GC CT C AGCTT T AC 114 0 

I I N I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I | I 

117 4 TACCTGAGAGACGAGAACTCCAGCAGGT CATTCCGTATCACAATCCTGCCTCAGCTTTAC 1233 

1141 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1200 

I I I I I I I I I I I I M I M I I I I I I I I I I M M I I I I I I I II I I | | | | M l l I l l I I 

1234 AT T C AGC C CAT GAT GGGGGCCGGCCT GAAT TAT GAAT GT T AC C GAT T C GG CAT T T C C C C A 12 93 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1260 

I I I I N I I I I I M I I I I I I I I I I | | | M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
12 94 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1353 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | II I I I I I I I I I I | I I I I 
1354 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1413 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

I I I I M I I I I I I I I I ! I M I I I I I I I | | | | | | | || | || | | | | | | | | | | | | | | | | | | | | | | 
1414 GT GT CT GAAATTT CCGGGCCTT T CT CAACAGAGGAT GT AGC CAGC AACTGTGTCCCCGCT 1473 



Qy 1381 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 

I I I M I I I t t I t I I i I I I I I I I I I I I I I I I I I I I I I I i | | | | | | | | | M I I I I I II II I I 
Db 1474 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1533 

QY 14 41 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 150 0 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I | | | I I I I I I I | I I I I I | | M 
Db 1534 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1593 

Qy 1501 CGT GAC C C T G AG GT C GT C AAT GAT GAGT C C T CT C T GGT C AGACAT C GCT G GAAAT GAAT A 1560 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I II I I I I I I I I I I I 
Db 1594 CGT GAC C CT GAGGT C GT CAAT GAT GAGT C C T CT C T GGT C AGACAT C GCT G GAAAT GAAT A 1653 

Qy 1561 GC C AG G C CT GAC CT CAAGCAAC CAT GAAC T C AG C TAT T AAGAAAAT C AC AT T T C C AGGGC 1620 

I I I I M I I I ! I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | I | | | | 
Db 1654 GC C AGGC CT GAC C T CAAGCAAC CAT GAACT C AG C TAT TAAGAAAAT C AC AT T T C C AGGGC 1713 

Qy 1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

I I I I M I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II | | | | | | | | | | | | | | | | | 
Db 1714 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 17 73 

Qy 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I | | | | | | | | | | | | | | M | | | | | | | | | 
Db 1774 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 18 33 

Qy 17 41 CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAA 17 8 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1834 CT C C CTAC T T C CAAGAAAAAT AATT AAAAAAAAAACTT C AT T C TAA 187 9 



RESULT 14 

US-09-978-608A-195 

; Sequence 195, Application US/09978608A 
; Publication No. US2 003 004 54 62A1 
; GENERAL INFORMATION: 



APPLICANT 


: Ashkenazi, Avi 


APPLICANT 


: Baker Kevin P. 


APPLICANT 


: Botstein, David 


APPLICANT 


: Desnoyers, Luc 


APPLICANT 


: Eaton, Dan 


APPLICANT 


: Ferrara, Napoleon 


APPLICANT 


: Filvarof f , Ellen 


APPLICANT 


: Fong, Sherman 


APPLICANT: 


: Gao, Wei-Qiang 


APPLICANT: 


: Gerber, Hanspeter 


APPLICANT: 


: Gerritsen, Mary E . 


APPLICANT: 


: Goddard, Audrey 


APPLICANT: 


: Godowski, Paul J. 


APPLICANT: 


Grimaldi, J. Christopher 


APPLICANT: 


Gurney, Austin L. 


APPLICANT: 


Hillan, Kenneth J 


APPLICANT: 


Kljavin, Ivar J. 


APPLICANT: 


Kuo, Sophia S. 


APPLICANT: 


Napier, Mary A. 


APPLICANT: 


Pan, James; 


APPLICANT: 


Paoni, Nicholas F. 


APPLICANT: 


Roy, Margaret Ann 


APPLICANT: 


Shelton, David L. 



APPLICANT: Stewart, Timothy A. 

APPLICANT: Tumas, Daniel 
; APPLICANT: Williams, P. Mickey 
; APPLICANT: Wood, William I. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
; TITLE OF INVENTION: Acids Encoding the Same 

FILE REFERENCE: P2630P1C22 

CURRENT APPLICATION NUMBER: US/09/ 978 , 608A 
; CURRENT FILING DATE: 2001-10-16 

NUMBER OF SEQ ID NOS : 624 
; Prior Application removed - See File Wrapper or Palm 
; SEQ ID NO 195 
LENGTH: 18 7 9 
TYPE: DNA 
; ORGANISM: Homo sapien 
US-09-97 8-608A-195 



Query Match 98.9%; Score 1784.4; DB 10; Length 1879; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1785; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 
i 1 1 1 i M 1 1! 1 1 1 1 1 t 1 1 1 M 1 1 1 1 1 1 1 i 1 1 1 1 1 I I I I | | | | M | | | | | | | | | | M 1 1 1 1 
ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 


60 


Db 


94 


153 


Qy 


61 


GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 

I 1 1 f M 1 1 1 1 1 1 1 1 1 i 1 I I I I I I I | | | | ! | | | | M 1 1 1 1 1 1 1! 1 1 1 1 1 1 I I I I I I I I | | f 

GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 


120 


Db 


154 


213 


yy 


X Z _L 


L-CjL-LjI AG1 1 GCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 

1 1 1 I M 1 1 1 1 i 1 1 1 1 1 M 1 1 1 1 1 1 1 1 If 1 1 1 1 1 1 1 1 1 I I I I | | | | | | || f | | M M | m | 

CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 


180 


Db 


214 


273 


Qy 


181 


GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 
M 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I I I I | | | | | | | | f | | | | | | | | | | | | | | | | || | | | | 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 


240 


Db 


274 


333 


Qy 


241 


GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 
1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I j | | | | | | | | | | | | | 
GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 


300 


Db 


334 


393 


Qy 


301 


C C C C C GC AGAAGC T AC AGAT T C T C GT T GAC ACT GGAAGCAGT AAC T T T GC C GT GG CAG GA 

1 i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | M 1 1 1 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGA 


360 


Db 


394 


453 


Qy 


361 


AC C C C G C ACT CCT AC AT AGACAC GT ACT T T GAC ACAGAGAGGT CTAG C AC AT AC C G C T C C 

1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 I I I I I I I I | | | | | | | | | | | | | | | | || | | | | | | | | | | | 

AC C C C GC ACT CCTAC AT AG ACAC GT AC T T T GAC ACAGAGAGGT CT AGCAC AT AC C G CT C C 


420 


Db 


454 


513 


Qy 


421 


AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 

M 1 1 M 1 1 1 1 M 1 1 1 M 1 M 1 1 1 M 1 1 II 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

AAGGGCTTT GAC GTCACAGTGAAGT ACAC ACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 


480 


Db 


514 


573 


Qy 


481 


GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 

1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 
GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 


540 


Db 


574 


633 



541 T T T GAAT CAGAGAAT TTCTTTTTGC CT GGGATTAAAT G GAAT GGAAT AC T T GGC CT AG CT 60 0 

M I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
634 TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 693 

601 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 66 0 

I I M I M I I II II I I I I I I I I I I I I I I i I I I I I I I M I I I I I I I I I I M I l I I I I i I ( l l 

694 TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 7 53 

661 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 72 0 

I M I I I I I I I I I I I I M I I f I I I I I I I I I II I I I I I I I I I I I I I f I I I If | | | | | | | | If 
754 CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 813 

721 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 78 0 
I I I I M I I I I I I I I I I I I I I I I I I M | | | | | | | | | | | | | | | |f | | | f f | | f | | { | | | | | | 

814 GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 873 

781 G GAGAC AT CT G GT AT AC C C C TAT T AAGGAAGAGT G GTACT AC CAGAT AGAAATT C T GAAA 84 0 

I M I II I I If I I I I I I I I I I I I I I I t I I I I I II I I I I I | | | | | | | | | || | | | | | | | | | | | 
874 GGAGACAT CT GGT ATAC C CCTATTAAGGAAGAGT GGTACTAC CAGATAGAAATT CT GAAA 933 

841 T T GGAAAT T GGAG G C CAAAGC CT TAAT C T GGACT GC AGAGAGT ATAAC GC AGACAAG GC C 900 

I I I I I I I I I I I I I I M I I If I I I I I I I I I f | | | | | | M I I I I I I I I I I I | | | | | f | M I I 
934 T T G GAAAT T G GAG G C CAAAGC CT TAAT C T GGACT GC AGAGAGT ATAAC G CAGACAAG G C C 993 

901 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 960 

I M I I I I ! I M I ! I I I I I I I I I I I I I I | | | f | | I I I I I I I I I I I I I I | | 

994 ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 1053 

961 GAAGCTGT GGC CCGCGCATCTCTGATTCCAGAATTCTCT GAT GGTTTCT GGACT GGGTCC 1020 
I I I I I I I M I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I II I I | If I I I I I I I I I I 
1054 GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 1113 

1021 CAGCTGGCGTGCTGGAC GAAT TCGGAAACACCTT GGT CTTACTTCCCTAAAATCTCCATC 108 0 

I M I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | M | | If | | | | | | ( | | | | | | | | | | | | | 
1114 CAGCTGGCGTGCTGGACGAATTCGGAAACACCTT GGT CTTACTTCCCTAAAATCTCCATC 1173 

10 81 TACCTGAGAGATGAGAACTCCAGCAGGTCATTCCGTATCACAATCCTGCCTCAGCTTTAC 114 0 
I N I I I I I I I I I I I | | | | | | | | | | | | | | | || | |f | | | | | | | | J | f | | | | | | | | || | | | | 

1174 TACCTGAGAGACGAGAACTCCAGCAGGTCATTCCGTATCACAATCCTGCCTCAGCTTTAC 1233 

1141 AT T CAGC C CAT GAT GGGGGCCGGCCT GAAT TAT GAAT GT T AC C GAT T C G G CAT T T C C C CA 12 00 

I 1 N I I I I I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I I I I I I I I I I I I I I I | | | | | | | | 
1234 ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 1293 

12 01 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 12 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I If I I I I I I I I I I I I | | | | | | | | | | | | | | | 
12 94 TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 1353 

12 61 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 132 0 

I I I I I I I I I I M I I I I I I I I I I I I I I 1 I I I I M If I I I I I I I I | | | | | | | | | | | | | | | | | 

13 5 4 AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 1413 

1321 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 138 0 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I If I I I I I I I 

1414 GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 14 7 3 

13 81 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 14 4 0 



Db 14 74 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1533 

Qy 1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

I I I I I I I M t I I II M I I I I I I I I t I II I I ! I I I II I I I I | | | | | | | | | | | | | | | | | | j | 
Db 1534 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1593 

Qy 1501 C GT GAC C C T GAG GT C GT CAAT GAT GAGT C CTCTCTGGT C AGAC AT C GC T GGAAAT GAAT A 1560 

I M I I I M I I I I I II I I I I I I II I I I I I I M I I I I I I I I I I I II I I II M I I I I I I I I I I 

Db 1594 C GT GAC C CT GAG GT C GT CAAT GAT GAGT CCTCTCTGGT C AGAC AT C GC T GGAAAT GAAT A 1653 

Qy 1561 GC CAGG C CT GAC C T CAAGC AAC CAT GAACT CAG CT AT T AAGAAAAT CAC AT T T C CAGG GC 1620 

I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | j f | | | I II 
Db 1654 GC CAGGC CT GAC C T C AAGCAAC CAT GAACT CAGCT AT TAAGAAAAT CAC AT T T C CAGGGC 1713 

Qy 1621 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

M I M I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1714 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 1773 

QY 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 174 0 

I I I f i I I M I M I I I I I I I I I I I I I I I I | | | | | I I I I I I I I I I I I I I I I I I I I I I I | | M 
Db 177 4 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1833 

Qy 17 41 CTCCC T AC T T C CAAGAAAAAT AAT T AAAAAAAAAACT T CATT CTAA 17 8 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I | | 
Db 1834 CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAA 187 9 



RESULT 15 

US-09-97 8-585A-195 

Sequence 195, Application US/09978585A 
Publication No. US2003004 9633A1 
GENERAL INFORMATION: 



APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 



Ashkenazi, Avi 
Baker Kevin P. 
Botstein, David 
Desnoyers, Luc 
Eaton, Dan 
Ferrara, Napoleon 
Filvarof f , Ellen 
Fong, Sherman 
Gao, Wei-Qiang 
Gerber, Hanspeter 
Gerritsen, Mary E. 
Godclard, Audrey 
Godowski, Paul J. 
Grimaldi, J. Christopher 
Gurney, Austin L . 
Hillan, Kenneth J 
Kljavin, Ivar J. 
Kuo, Sophia S. 
Napier, Mary A. 
Pan, James; 
Paoni, Nicholas F. 
Roy, Margaret Ann 
Shelton, David L. 
Stewart, Timothy A. 



; APPLICANT: Tumas, Daniel 

; APPLICANT: Williams, P. Mickey 

; APPLICANT: Wood, William I. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
; TITLE OF INVENTION: Acids Encoding the Same 

FILE REFERENCE: P2630P1C15 

CURRENT APPLICATION NUMBER: US/09/978,58 5 A 

CURRENT FILING DATE: 2001-10-16 

NUMBER OF SEQ ID NOS : 624 
; Prior Application removed - See File Wrapper or Palm 
; SEQ ID NO 195 
; LENGTH: 187 9 
TYPE: DNA 

ORGANISM: Homo sapien 
US-0 9-978-585A-195 

Query Match 98.9%; Score 1784.4; DB 10; Length 1879; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1785; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

QY 1 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 60 

I ' I II I I 1 I I I I I I I I I ! I M | | f I | | | | ! | | | | | | [ | | | | j | | | | | | | | | | | | | { | | | | 

Db 94 ATGGGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 153 

Qy 61 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 12 0 

I I M I I II I I I I I I I I I I I I I I I I I M I | | | | | | | | | | | | | | | | | | | | | | | | | | | M | | | 
Db 154 GCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAAC 213 

Qy 121 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 18 0 

I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I i | | | | 
Db 214 CGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTG 2 73 

Qy 181 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 24 0 

I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I M I I I I II I I I I I I I I I I I 

Db 274 GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 333 

Qy 241 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 3 00 

M I I I I I I I I M I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I || I I I | | | | | | | 
Db 334 GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 393 

Qy 301 C C C C C G CAGAAGCT ACAGAT T C T C GT T GAC ACT G GAAGCAGTAAC T T T GC C GT G G CAGGA 360 

I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | |[ 
Db 394 C C C C C GCAGAAGCT ACAGAT T CT C GT T GAC AC T GGAAG C AGTAAC T T T GC C GT GG CAGGA 453 

Qy 3 61 AC C C C GC AC T C C T AC ATAGACAC GT AC T T T GACACAGAGAGGT CT AG CACAT AC C GCT C C 42 0 

I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | 
Db 4 54 AC C C C GCACT C C T AC ATAGACAC GT ACT T T GACACAGAGAGGT CT AG CACAT ACC GCT C C 513 

Qy 421 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 480 

I I I I I I I M I I I I I I I I I II I M I I I I I I I I I I I II I I I I I I I I I I I I I II M I I I I I I I 
Db 514 AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 573 

Qy 4 81 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 54 0 

M I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I I I I I I I | | | I I I I I I I I I I I I I I I I I I I 
Db 574 GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 633 



Qy 



541 TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 600 



Db 


634 


1 1 1 M 1 II 1 1 1 1 1 II I I I | I I | M I | | | | | I | | | | | | | | | | | | | | | | | | | | | | | j | | | | | 
TTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 


693 


Qy 


601 


TAT GC C AC AC T T GC CAAG C CAT CAAGTT CT C T GGAGAC CT T C T T C GACT C C C T GGT GACA 

M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I | | || | | | | | | M 1 1 1 1 1 1 1 I I II I I | | j | 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 


660 


Db 


694 


753 


Qy 


661 


CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 

1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 I I | | M 1 1 1 1 1 1 1 1 M 1 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCT 


720 


Db 


754 


813 


Qy 


721 


GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 

' N 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 II 1 1 1 1 1 1 1 1 I I I I I I 1 1 1 1 1 | M 1 1 1 M 1 II 1 1 1 1 
GGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGTATAAA 


780 


Db 


814 


873 


Qy 


781 


GGAGACAT CT GGTATACCCCTATTAAGGAAGAGT GGTACTAC CAGAT AGAAATT CTGAAA 

1 1 1 M M M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | M | | | | | | | | | | | | | | | | 

GGAGACAT C T G GT AT AC C C CT AT TAAGGAAGAGT GGTAC T AC CAGAT AGAAAT T CT GAAA 


840 


Db 


874 


933 


Qy 


841 


T T GGAAAT T G GAG GC CAAAGC CT TAAT CT G GACT GCAGAGAGT ATAAC GC AGAC AAGGC C 

f 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 I I I | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I M 1 1 1 1 1 

T T GGAAAT T G GAG GC CAAAGC CT TAAT C T G GACT GCAGAGAGT ATAAC GCAGACAAG G C C 


900 


Db 


934 


993 


Qy 


901 


ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 

f 1 1 1 1 f 1 1 M 1 1 1 1 1 1 1 1 j 1 | | | 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
ATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 


960 


Db 


994 


1053 


Qy 


961 


GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 
1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 I 1 | | | | | | M | | | | | | M M | || | | | | | | | | | [ | | | | | | | | | | 

GAAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCC 


1020 


Db 


1054 


1113 


Qy 


1021 


CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 
1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | | | | | | I | | | 1 1 1 1 1 1 1 1 II 1 1 f 1 1 1 1 1 1 1 1 
CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATC 


1080 


Db 


1114 


1173 


Qy 


1081 


TACCTGAGAGATGAGAACTCCAGCAGGTCATTCCGTATCACAATCCTGCCTCAGCTTTAC 
1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | II I 1 f 1 1 1 1 1 1 1 I 
T AC CT GAGAGAC GAGAACT C CAG CAG GT CAT T C C GT AT C ACAAT C CT GC C T C AGC T T T AC 


1140 


Db 


1174 


1233 


Qy 


1141 


ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCA 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 

AT T C AGC C CAT GAT GGGG GC C GGC C T GAAT TAT GAAT GT T AC C GAT T C GG C AT T T C C CC A 


1200 


Db 


1234 


1293 


Qy 


1201 


TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 

1 i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 I I | | | | M 1 II 1 1 1 II 
TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGAC 


1260 


Db 


1294 


1353 


Qy 


1261 


AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 
1 1 1 1 f M 1 1 1 II 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | | | | | | | | J | | | | | | J | | | | | j | 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 


1320 


Db 


1354 


1413 


Qy 


1321 


N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I (j | 1 1 1 1 1 1 1 1 1 

GTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCT 


T t o r\ 


Db 


1414 


1473 


Qy 


1381 


CAGTCTTTGAGCGAGCCCATTTTGTGGATTGT GTCCTATGCGCTCATGAGCGTCTGTGGA 

H M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


1440 



Db 147 4 CAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 1533 

Qy 1441 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 1500 

I I I I M I I I I I I I ! M I I i I i I I I I I I I I I I I I I I I I I II I I I I I I I I i I I I I I I I I I I I 
Db 1534 GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCC 15 9 3 

Qy 1501 C GT GAC CCT GAGGT C GT C AAT GAT GAGT C CT C T C T GGT CAGAC AT C GCT G GAAAT GAAT A 1560 

I I I I I I I I I I I I I I I I ! I I I I E I I I I I I 1 I I I I I I I I E I I I I I I I I I I I I I I II I I I I I I 
Db 1594 C GT GAC CCT GAGGT C GT CAAT GAT GAGT C CT CT C T GGT CAGAC AT C GCT G GAAAT GAAT A 1653 

Qy 1561 GC CAGGC CT GAC C T CAAG CAAC CAT GAACT C AGCT AT T AAGAAAAT C AC AT T T C C AGGG C 1620 

I ! I I I I I I I I I I I I I I I I I I It I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1654 GC CAGGC CT GAC C T CAAG CAAC CAT GAACT CAGCT AT TAAGAAAAT CAC AT T T C C AGGG C 1713 

Qy 1 62 1 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 168 0 

I I I I I I M I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I If I II I I I I I I II I I I I I i I I 
Db 1714 AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 1773 

Qy 1681 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 1740 

I I I I I I M I i I I I I I I I I I I I II I I I I II I I I I I II I I I I! I I I I I I I I I It I I I I I I I I 
Db 1774 GCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATC 18 33 

Qy 1741 CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAA 1786 

I I I I i I I I It I I I I I I I I I I I I 1 I I I I M I I I I I I I I I I II I I I I I 
Db 1834 CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAA 187 9 



Search completed: March 6, 2004, 02:17:32 
Job time : 662 sees 



